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Introduction 



I have written this book mainly for students who will need to apply maths in science or 
engineering courses. It is particularly designed to help the foundation or first year of such 
a course to run smoothly but it could also be useful to specialist maths students whose 
particular choice of A-level or pre-university course has meant that there are some gaps in 
the knowledge required as a basis for their University course. Because it starts by laying the 
basic groundwork of algebra it will also provide a bridge for students who have not studied 
maths for some time. 

The book is written in such a way that students can use it to sort out any individual 
difficulties for themselves without needing help from their lecturers. 

A message to students 

I have made this book as much as possible as though I were talking directly to you about the 
topics which are in it, sorting out possible difficulties and encouraging your thoughts in 
return. I want to build up your knowledge and your courage at the same time so that you are 
able to go forward with confidence in your own ability to handle the techniques which you 
will need. For this reason, I don't just tell you things, but ask you questions as we go along 
to give you a chance to think for yourself how the next stage should go. These questions are 
followed by a heavy rule like the one below. 



It is very important that you should try to answer these questions yourself, so the rule is 
there to warn you not to read on too quickly. 

I have also given you many worked examples of how each new piece of mathematical 
information is actually used. In particular, I have included some of the off-beat non-standard 
examples which I know that students often find difficult. 

To make the book work for you, it is vital that you do the questions in the exercises as 
they come because this is how you will learn and absorb the principles so that they become 
part of your own thinking. As you become more confident and at ease with the methods, you 
will find that you enjoy doing the questions, and seeing how the maths slots together to solve 
more complicated problems. 

Always be prepared to think about a problem and have a go at it - don't be afraid of 
getting it wrong. Students very often underrate what they do themselves, and what they can 
do. If something doesn't work out, they tend to think that their effort was of no worth but 
this is not true. Thinking about questions for yourself is how you learn and understand what 
you are doing. It is much better than just following a template which will only work for very 
similar problems and then only if you recognise them. If you really understand what you are 
doing you will be able to apply these ideas in later work, and this is important for you. 

Because you may be working from this book on your own, I have given detailed solutions 
to most of the questions in the exercises so that you can sort out for yourself any problems 
that you may have had in doing them. (Don't let yourself be tempted just to read through my 
solutions - you will do infinitely better if you write your own solutions first. This is the most 
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important single piece of advice which I can give you.) Also, if you are stuck and have to 
look at my solution, don't just read through the whole of it. Stop reading at the point that 
gets you unstuck and see if you can finish the problem yourself. 

I have also included what I have called thinking points. These are usually more open- 
ended questions designed to lead you forward towards future work. 

If possible, talk about problems with other students; you will often find that you can help 
each other and that you spark each other's ideas. It is also very sensible to scribble down your 
thoughts as you go along, and to use your own colour to highlight important results or 
particular parts of drawings. Doing this makes you think about which are the important bits, 
and gives you a short-cut when you are revising. 

There are some pitfalls which many students regularly fall into. These are marked 



to warn you to take particular notice of the advice there. You will probably recognise some 
old enemies! 

It often happens in maths that in order to understand a new topic you must be able to use 
earlier work. I have made sure that these foundation topics are included in the book, and I 
give references back to them so that you can go there first if you need to. I have linked topics 
together so that you can see how one affects another and how they are different windows 
onto the same world. The various approaches, visual, geometrical, using the equations of 
algebra or the arguments of calculus, all lead to an understanding of how the fundamental 
ideas interlock. I also show you wherever possible how the mathematical ideas can be used 
to describe the physical world, because I find that many students particularly like to know 
this, and indeed it is the main reason why they are learning the maths. (Much of the maths 
is very nice in itself, however, and I have tried to show you this.) 

I have included in some of the thinking points ideas for simple programs which you could 
write to investigate what is happening there. To do this, you would need to know a 
programming language and have access to either a computer or programmable calculator. I 
have also suggested ways in which you can use a graph-sketching calculator as a fast check 
of what happens when you build up graphs from combinations of simple functions. 
Although these suggestions are included because I think you would learn from them and 
enjoy doing them, it is not necessary to have this equipment to use this book. 

Much of the book has grown from the various comments and questions of all the students I 
have taught. It is harder to keep this kind of two-way involvement with a printed book but no 
longer impossible thanks to the Web. I would be very interested in your comments and 
questions and grateful for your help in spotting any mistakes which may have slipped through 
my checking. You can contact me via my website and I look forward to putting little additions 
on the Web, sparked by your thoughts. My website is at http://www.mathssurvivalguide.com 

Finally, I hope that you will find that this book will smooth your way forward and help 
you to enjoy all your courses. 
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Introduction to the second edition 



I have thoroughly revised all the ten chapters in the original edition, both making some 
changes due to comments from my readers and also checking for errors. I've also added a 
chapter on vectors which continues naturally from the present chapter on complex 
numbers. 

I wrote the first version of this new chapter as an extension to the book's website (which 
is now at http://www.mathssurvivalguide.com) building up the pages there gradually. Their 
content was influenced by emails from visitors, often with particular problems with which 
they hoped for help. I've now extensively rewritten and rearranged this material. Writing in 
book form, it was possible to structure the content much more closely than on the Web so 
that it's easy to see the connections between the different areas and how results can be 
applied to later problems. The new chapter also has, of course, many practice exercises with 
complete solutions just as the earlier chapters have. 



I'm once again very grateful to Rodie and Tony Sudbery and to David Olive for their 
helpful suggestions and comments. I must also thank all the people who emailed me, both 
with comments on the original ten chapters, and also with particular needs in using vectors 
which I've tried to fulfil here. 

I hope that this two-way communication will continue. You can email me from the book's 
website if you would like to. Finally, I once again hope that this book will help you and 
encourage you with your studies. 
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Basic algebra: some reminders of 
how it works 



In many areas of science and engineering, information can be made clearer and 
more helpful if it is thought of in a mathematical way. Because this is so, algebra is 
extremely important since it gives you a powerful and concise way of handling 
information to solve problems. This means that you need to be confident and 
comfortable with the various techniques for handling expressions and equations. 
The chapter is divided up into the following sections. 

1.A Handling unknown quantities 

(a) Where do you start? Self-test 1, (b) A mind-reading explained, 

(c) Some basic rules, (d) Working out in the right order, (e) Using negative numbers, 

(f) Putting into brackets, or factorising 

l.B Multiplications and factorising: the next stage 

(a) Self-test 2, (b) Multiplying out two brackets, 

(c) More factorisation: putting things back into brackets 

l.C Using fractions 

(a) Equivalent fractions and cancelling down, (b) Tidying up more complicated fractions, 
(c) Adding fractions in arithmetic and algebra, (d) Repeated factors in adding fractions, 
(e) Subtracting fractions, (f) Multiplying fractions, (g) Dividing fractions 

l.D The three rules for working with powers 

(a) Handling powers which are whole numbers, (b) Some special cases 
l.E The different kinds of numbers 

(a) The counting numbers and zero, (b) Including negative numbers: the set of integers, 

(c) Including fractions: the set of rational numbers, 

(d) Including everything on the number line: the set of real numbers, 

(e) Complex numbers: a very brief forwards look 

l.F Working with different kinds of number: some examples 

(a) Other number bases: the binary system, (b) Prime numbers and factors, 

(c) A useful application - simplifying square roots, 

(d) Simplifying fractions with v ' signs underneath 



1.A Handling unknown quantities 

l.A.(a) Where do you start? Self-test 1 

All the maths in this book which is directly concerned with your courses depends on a 
foundation of basic algebra. In case you need some extra help with this, I have included two 
revision sections at the beginning of this first chapter. Each of these sections starts with a 
short self-test so that you can find out if you need to work through it. 

It's important to try these if you are in any doubt about your algebra. You have to build 
on a firm base if you are to proceed happily; otherwise it is like climbing a ladder which has 
some rungs missing, or, more dangerously, rungs which appear to be in place until you tread 
on them. 
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Self-test 1 

Answer each of the following short questions. 

(A) Find the value of each of the following expressions if a = 3, b = 1, c = 0 and d = 2. 
(1) a 2 (2) b 2 (3) ab + d (4) a(b + d) (5) 2c + 3d 

(6) 2a 2 (7) (2a) 2 (8) 4ab + 3bd (9) a + be (10) d 3 

(B) Find the values of each of the following expressions if x = 2, y = -3, w = 1, v = -2, 
w = A andz = -l. 

(1) 3xy (2) 5vy (3) 2x + 3y + 2v (4) v 2 (5) 3z 2 

(6) w + vy (7) 2x - 5vw (8) 2>> - 3^ + 2z - w (9) 2j 2 (10) z 3 

(C) Simplify (that is, write in the shortest possible form). 

(1) 3p-2q+p + q (2) 3p 2 + 2pq - q 2 - Ipq (3) 5p - Iq - 2p - 3q + 3pq 

(D) Multiply out the following expressions. 

(1) 5(2g + 3h) (2) g(3g-2h) (3) 3k 2 (2k- 5m + 2n) (4) 3k-(2m + 3n-5k) 

(E) Factorise the following expressions. 

(1) 3x 2 + 2xy (2) 3pq + 6q 2 (3) 5x 2 y - Ixy 2 



Here are the answers. (Give yourself one point for each correct answer, which gives a 
maximum possible score of 30.) 

(A) (1) 9 (2) 1 (3) 5 (4) 9 (5) 6 (6) 18 (7) 36 (8) 18 (9) 3 (10) 8 

(B) (1) -IS (2)30 (3) -9 (4)4 (5)3 (6) 10 (7)44 (8) -6 (9) 18 (10) -1 

(C) (1) 4p - q (2) 3p 2 - 5pq - q 2 (3) 3p - \0q + 3pq 

(D) (1) lOg + \5h (2) 3g 2 - 2gh (3) 6k 3 - \5k 2 m + 6k 2 n (4) %k-2m- 3n 

(E) (1) x(3x + 2y) (2) 3q(p + 2q) (3) xy(5x - ly) 

If you scored anything less than 25 points then I would advise you to work through 
Section 1 A. If you made just the odd mistake, and realised what it was when you saw the 
answer, then go ahead to Section LB. If you are in any doubt, it is best to go through Section 
1 A. now; these are your tools and you need to feel happy with them. 

l.A.(b) A mind-reading explained 

Much of what was tested above can be shown in the handling of the following. Try it for 
yourself. (You may have met this apparently mysterious kind of mind-reading before.) 

(1) Think of a number between 1 and 10. (A small number is easier to use.) 

(2) Add 3 to it. 

(3) Double the number you have now. 

(4) Add the number you first thought of. 

(5) Divide the number you have now by 3. 

(6) Take away the number you first thought of. 

(7) The number you are thinking of now is ... 2! 
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How can we lay bare the bones of what is happening here, so that we can see how it is 
possible for me to know your final answer even though I don't know what number you were 
thinking of at the start? 

It is easier for me to keep track of what is happening, and so be able to arrange for it to 
go the way I want, if I label this number with a letter. So suppose I call it x. Suppose also 
that your number was 7 and we can then keep a parallel track of what goes on. 



You Me 

(1) 7 x 

(2) 10 x + 3 (My unknown number plus 3.) 

(3) 20 2(x + 3) = 2x + 6 (Each of these show the doubling.) 

(4) 27 2x + 6 + x = 3x + 6 (I add in the unknown number.) 

(5) 9 ^y-^ = x + 2 (The whole of 3x + 6 is divided by 3.) 

(6) 2 2 (The x has been taken away.) 



Both your 7 and my x have been got rid of as a result of this list of instructions. 
My list uses algebra to make the handling of an unknown quantity easier by tagging it 
with a letter. It also shows some of the ways in which this handling is done. 



l.A.(c) Some basic rules 

There are certain rules which need to be followed in handling letters which are standing for 
numbers. Here I remind you of these. 

Adding 

a + b means quantity a added to quantity b. 

a + a + b + b + b = 2a + 3b. Here, we have twice the first quantity and three times the 
second quantity added together. There is no shorter way of writing 2a + 3b unless we know 
what the letters are standing for. 

We could equally have said b + a for a + b, and 3b + 2a for 2a + 3b. It doesn't matter 
what order we do the adding in. 

Multiplying 

ab means a X b (that is, the two quantities multiplied together) and the letters are usually, 
but not always, written in alphabetical order. 

In particular, a X 1 = a, and a X 0 = 0. 
5ab would mean 5 X a X b. 

It doesn't matter what order we do the multiplying in, for example 3X5 = 5X3. 

Working out powers 

If numbers are multiplied by themselves, we use a special shorthand to show that this is 
happening. 

a 2 means a X a and is called a squared. 
a 3 means a X a X a and is called a cubed. 

a" means a multiplied by itself with n lots of a and is called a to the power n. 

Little raised numbers, like the 2, 3 and n above, are called powers or indices. Using these 
little numbers makes it much easier to keep a track of what is happening when we multiply. 
(It was a major breakthrough when they were first used.) You can see why this is in the 
following example. 
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Suppose we have a 2 X a 3 . 

Then a 2 = a X a and a 3 = a X a X a so a 2 X a 3 = aXaXaXaXa = a 5 . 
The powers are added. (For example, 2 2 X 2 3 = 4 X 8 = 32 = 2 5 .) 



We can write this as a general rule. 

a" X a m = a n+m 

where a stands for any number except 0 
and n and m can stand for any numbers. 



In this section, n and m will only be standing for positive whole numbers, so we can see 
that they would work in the same way as the example above. 

To make the rule work, we need to think of a as being the same as a l . Then, for example, 
a X a 2 = a 1 X a 2 = a 3 which fits with what we know is true, for example 2 X 2 2 = 2 3 or 
2X4 = 8. 

Also, this rule for adding the powers when multiplying only works if we have powers of 
the same number, so 2 2 X 2 3 = 2 5 and 7 2 X 7 3 = 7 5 but 2 2 X 7 3 cannot be combined as a 
single power. 

If we have numbers and different letters, we just deal with each bit separately, so for 
example 3a 2 b X lab 3 = 6a 3 b 4 . 

Working out mixtures - using brackets 

a + be means quantity a added to the result of multiplying b and c. The multiplication of b 
and c must be done before a is added. 

If a = 2 and b = 3 and c = 4 then a + be = 2 + 3X4 = 2+ 12 =14. 

If we want a and b to be added first, and the result to be multiplied by c, we use a bracket 
and write {a + b)c or c(a + b), as the order of the multiplication does not matter. This gives 
a result of 5 X 4 = 4 X 5 = 20. 

A bracket collects together a whole lot of terms so that the same thing can be done to all 
of them, like corralling a lot of sheep, and then dipping them. So a(b + c) means ab + ac. 
The a multiplies every separate item in the bracket. 

Similarly, 2x(x + y + 3xy) = 2x 2 + 2xy + 6x 2 y. The brackets show that everything inside 
them is to be multiplied by the 2x. It is important to put in brackets if you want the same 
thing to happen to a whole collection of stuff, both because it tells you that that is what you 
are doing, and also because it tells anyone else reading your working that that is what you 
meant. Many mistakes come from left-out brackets. 

Here is another example of how you need brackets to show that you want different 
results. 

If a = 2 then 3a 2 = 3X2X2 = 12 but (3a) 2 = 6 2 = 36. The brackets are necessary to 
show that it is the whole of 3a which is to be squared. 



exercise 1.A.1 Try these questions yourself now. 

(1) Put the following together as much as possible. 

(a) 30 + 2b + so + 7C - b - 4c (b) 30b + b + 50 + ib + iba 
(c) 7p + 3pq - 2p + 2pq + 8q (d) 5* + 2y - 3X + xy + 3/ + 2xy 

(2) If a = 2 and b = 1, find 

(a) a 3 (b) 5a 2 (c) (50) 2 (d) b 2 (e) 2a 2 + 3b 2 
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(3) Multiply the following together. 

(a) (2x)(3/) (b) (3x 2 )(5xy) (0)3(20 + 36) (01)20(30 + 5/?) 
(e) 2p(3p 2 + 2pq + q 2 ) (f) 2x 2 (3* + 2xy + y 2 ) 

l.A.(d) Working out in the right order 

If you are replacing letters by numbers, then you must stick to the following rules to work 
out the answer from these numbers. 



(1) In general, we work from left to right. 

(2) Any working inside a bracket must be done first. 

(3) When doing the working out, first find any powers, then do any multiplying and 
dividing, and finally do any adding and subtracting. 



Here are two examples. 

example (1) If a = 2, b = 3, c = 4 and d = 6, find 3a(2d + be) - Ac. 

• Find the inside of the bracket, which is 2X6 + 3 X4=12+12 = 24. 

• Multiply this by 3a, giving 6 X 24 = 144. 

• Find 4c, which is 4 X 4 = 16. 

• Finally, we have 144 - 16 = 128. 

example (2) If x = 2, y = 3, z = 4 and w = 6, work out the value of x(2y 2 -z) + 3w 2 . 

We start by working out the inside of the bracket. 

• Find y 2 which is 9. 

• The bracket comes to2X9-4 = 14. 

• Multiply this by x, getting 28. 

• w 2 = 6 2 = 36 so 3w 2 = 108. 

• Finally, we get 28 + 108 = 136. 

exercise 1.A.2 Now try the following yourself. 

(1) If 0 = 2, b = 3, c = 4, d = 5 and e = 0 find the values of: 

(a) ab + cd (b) ab 2 e (c) ab 2 d (d) (abd) 2 (e) o(b + cd) 

(f) ab 2 d + c 3 (g) ab + d - c (h) 0(6 + d) - c 

(2) Multiply out the following, tidying up the answers by putting together as much 
as possible. 

(a) 3*(2x + 3/) + 4/(x + 7y) (b) 5P 2 (2p + 30) + <7 2 (3P + 59) + Pqip + 20) 
Check your answers to these two questions, before going on. 
Questions (3) and (4) are very similar to (1) and (2) and will give you some 
more practice if you need it. 

(3) If a = 3, b = 4, c = 1, d = 5 and e = o find the values of: 

(a) a 2 (b) 3b 2 (c) (3b) 2 (d) c 2 (e) ab + c (f) bd-ac (g) b{d-ac) 
(h) d 2 - b 2 (i) (d - b) (d + b) (j) d 2 + b 2 (k) (d + b) (d + b) 

(I) o 2 b + c 2 d (m) se(o 2 - 3b 2 ) (n) a» + d° 
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(4) Multiply out and collect like terms together if possible: 
(a) 30(26 + 3c) + 20(0 + 5c) (b) 2xy(3x 2 + 2xy + y 2 ) 
(c) 5p(2p + 30) + 2q(3p + q) (d) 2c 2 (3c + 2d) + 5c/ 2 (2c + d) 



l.A.(e) Using negative numbers 

We shall need to be able to do more complicated things with minus signs than we have met 
so far, so here is a reminder about dealing with signed numbers. 

Ordinary numbers, such as 6, are written as +6 in order to show that they are different 
from negative numbers such as -5. If the sign in front of a number is +, then it can 
sometimes be left out. (We don't speak of having +2 apples, for example.) A negative sign 
can never be left out, in any working combination of numbers. 

One way of understanding how signed numbers work is to think of them in terms of 
money. Then +2 represents having £2, and -3 represents owing £3, etc. 

So using brackets to keep each number and its sign conveniently connected, we have 
for example: 



(+2) + (+5) = (+7) 
(-3) + (-7) = (-10) 
(+4 ) + (-9) = (-5) 
(+3) -(-7) =(+10) 



Ordinary addition. 

Adding two debts. 

You still have a debt. 

Taking away a debt means you gain. 



The same idea carries through to multiplication (which can be thought of as repeated 
addition, so 3 X 2 means 3 lots of 2, or adding 2 to itself three times). 
Some examples are: 



(+2) X (-3)= (-6) 
(-3) X (+5)= (-15) 
(-3) X (-7) =(+21) 



Doubling a debt! 

Taking away 3 lots of 5. 

Taking away a debt of 7 three times. 



The rule for multiplying signed numbers 

Two signs which are the same give plus and two different signs give minus. 



Here are two examples of this in action. 

(1) 3a - 2(b - 2a) + lb = 3a - 2b + 4a + lb = la + 5b. 

(2) 2p - (p + 2q - m). 

Here, you can think of the minus sign outside the bracket as meaning -1, so that when 
the bracket is multiplied by it, all the signs inside it will change. 
We get 2p - p - 2q + m = p - 2q + m. 



exercise 1.A.3 Now try the following questions. 

Multiply out the following, tidying up the answers as much as possible. 
(1) 2x - (x - 2y) + sy (2) 4(30 - 2b) - 6(20 - b) 

(3) 6(2C + d) - 2(3C - d) + 5 (4) 60 - 2(30 - 56) - (0 + 40) 

(5) 3x(2x - yy + 2z) - 4x(2x + $y - 3z) (6) 2xy(3* - 4/) - sxy(2x - y) 
(7) 2a 2 (3o - 2ab) - 50/7(20 2 - 40b) (8) -3p - (p + q) + 2q(p - 3) 
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l.A.(f) Putting into brackets, or factorising 

The process described in the previous section can be done in reverse, so, for example, 

xy + xz = x(y + z). 

This reverse process is called factorisation and x is called a factor of the expression, that 
is, something you multiply by to get the whole answer, just as 2, 3, 4, 6 are all factors of 
12. We can say 12 = 3X4 = 2X6. Each factor divides into 12 exactly. 

Here are three examples showing this process happening. 

(1) 3a 2 + lab = a(3a + lb). This is as far as we can go. 

(2) 3p 2 q + 4pq 2 = pq (3p + 4q) factorising as much as possible. 

(3) 4a 2 b 3 - 6a 3 b 2 = 2a 2 b 2 (2b - 3a) factorising as far as possible. 



It is useful to remember that factorisation is just the reverse process to 
multiplying out. If you are at all doubtful that you have factorised correctly, 
you can check by multiplying out your answer that you do get back to what 
you started with originally. 



Here's an example. 

If you factorise 3c 2 + 2cd + c, which of the following gives the right answer? 
(1) 3c(c + 2^+1) (2) c(3c + 2d) (3) c(3c + 2d + 1). 




Multiplying out gives (1) 3c 2 + 6cd+3c (2) 3c 2 + led and (3) 3c 2 + 2cd + c so (3) is 
the correct one. 

exercise i.A.4 Factorise the following yourself, taking out as many factors as you can. 

(i) 50 + 106 (2) 30 2 + iab (3) 30 2 - dab 

(4) 5xy+8xz (5) sxy-ioxz (6) a 2 b + 30/j 2 

(7) lipq 2 - 6p 2 q (8) 3* 2 y 3 + sx 3 y 2 

(9) 4p 2 q + 2pq 2 - 6p 2 q 2 (10) 2o 2 6 3 + 3a 3 b 2 - 6a 2 b 2 

l.B Multiplications and factorising: the next stage 

l.B.(a) Self-test 2 

This section also starts with a self-test. It is sensible to do it even if you think you don't have 
any problems with these because it won't take you very long to check that you are in this 
happy state. It's a good idea to cover my answers until you've done yours. 

(A) Multiply out the following 

(1) (2x + 3y) (x + 5y) (2) (3a - 5b)(la - b) (3) (3x + I) 2 

(4) (ly-5) 2 (5) (lp 2 + 3pq)(q 2 -lpq) 



l.B Multiplications and factorising: the next stage 
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Factorise the following. 

(B) (1) x 2 + 9x + 14 (2) y 2 + Sy + 12 

(C) (1) 2x 2 + 7x + 3 (2) 3a 2 + 16a + 5 

(D) (l)x 2 +x-2 (2)2a 2 + a-15 
(5) 6y 2 - I9y + 10 (6) 4x 2 - 81j 2 



(3) x 2 + 8x + 16 (4) p 2 + Up + 22 

(3) 3b 2 +106 + 7 (4) 5x 2 + 8x + 3 

(3) 2x 2 + 5x - 12 (4) p 2 - q 2 

(7) 6x 2 - 19x+ 10 (8) 4x 2 - 12x + 9 



As in the first test, give yourself one point for each correct answer so that the highest total 
score is 2 1 . Again, if you got 1 6 or less, work through this following section. 

If you are in any doubt, it is much better to get it sorted out now, because lots of later 
work will depend on it. 

These are the answers that you should have. 

(A) (1)2x 2 + 13xj+15j 2 (2)6a 2 -Uab + 5b 2 (3)9x 2 +12x + 4 



(4) 4j 2 -20j + 25 

(B) (l)(x + 2)(x + 7) 

(C) (l)(2x+l)(x + 3) 

(D) (l)(x + 2)(x-l) 

(5) (3y-2)(2y-5) 



(5) 3pq 3 -4p 3 q-4p 2 q 2 

(2)(y + 2)(y + 6) (3)(x + 4) 2 (4) (p + 2)(p+ 1 1) 

(2)(3a+l)(a + 5) (3) (36 + 7)(6 + l) (4) (5x + 3)(x+ 1) 

(2)(2a-5)(fl + 3) (3)(2x-3)(x + 4) (4) (p-q)(p + q) 

(6) (2x - 9y)(2x + 9y) (7) (3x - 2)(2x - 5) (8) (2x - 3) 2 



l.B.(b) Multiplying out two brackets 

To multiply out two brackets, each bit of the first bracket must be multiplied by each bit of 
the second bracket, so 



(a + b)(c + d) = ac + bd + ad + be. 



The ac + bd + ad + be can be written in any order. 
You could also think of this process, if you like, as 

{a + b)(c + d) = a(c + d) + b(c + d) = ac + ad + be + bd. 

You can see this working numerically by putting a = 1, b = 2, c = 3 and d = 4. 

(a + b)(c + d) = (1 + 2)(3 + 4) = 3 X 7 = 21 

and 

ac + ad + be + bd = 3 + 4 + 6 + 8 = 21. 

Also, you can see that the order of doing the multiplying doesn't matter, since 

ac + 6<i + be + at/ = 3 + 8 + 6 + 4 = 21 too. 

Figure l.B.l shows this process happening with areas, (a + 6)(c + d) gives the total area of 
the rectangle. 




vv::^&-.. ..^f? 



Figure l.B.l 
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Exactly the same system is used to work out (a + b) 2 . We have 

(a + bf = (a + b)(a + b) = a 2 + ab + ab + b 2 = a 2 + lab + b 2 
We can see this working in Figure l.B.2. 



CL b 



a* 


OLD ivf" 













Figure l.B.2 



We can see the two squares and the two same-shaped rectangles. 

0 — 

The diagram shows that (a + b) 2 is not the same thing as a 2 + b 2 . In a similar way, we have 

(a - b) 2 = (a- b)(a - b) = a 2 - lab + b 2 . 
What happens if the signs are opposite ways round, so we have (a + b)(a - b)l 



We get 

{a + b)(a -b) = a 2 -b 2 
because the middle bits cancel out. 
This result is called the difference of two squares. 



You need to be good at spotting examples of this because it is of very great importance 
in simplifying and factorising in many different situations. 

To help you to get good at this, here are some further examples. 

Put back into two brackets (1) x 2 - 9y 2 , (2) 49a 2 - 64b 2 . 

The answers are (1) (x + 3y)(x - 3y) and (2) (la + Sb)(7a - 8b). 

Check these are true by multiplying them back out, and then try the following ones for 
yourself. 

(1) x 2 -y 2 (l)4a 2 -9b 2 (3) I6p 2 - 9q 2 (4) 16a 2 - 15b 2 (5) 36p 2 - 100g 2 



l.B Multiplications and factorising: the next stage 
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These are the answers that you should have. 



(1) (x + y)(x - y) (2) (2a + 3b)(2a - 3b) (3) (4p + 3q)(4p - 3q) 

(4) (4a + 5b)(4a - 5b) (5) (6p + I0q)(6p - lOq) 

In each case, the brackets can equally well be written the other way round since the letters 
are standing for numbers. 

Here is a more complicated example of multiplication of brackets. 

(3x + xy)(xy + y 2 ) = 3x 2 y + x 2 y 2 + 3xy 2 + xy 3 

Again, the basic strategy is the same. Each bit or chunk of the first bracket is multiplied 
by each bit or chunk of the second one. 

(This can be checked by putting x = 2 and y = 3. Each side should come to 180.) 

exercise i.b.i Multiply out the following pairs of brackets. 

(l) (x + 2)(x + 3) (2) (o + 3)(o - 4) (3) (x - 2)(x - 3) 

(4) (p + 3)(2p + 1) (5) (3X - 2)(3x + 2) (6) (2x - 3y)(x + 2/) 

(7) (30 - 2b) (20 - sb) (8) (3X + 4/) 2 (9) (3X - 4/) 2 

(10) (3X + 4y)(3* - 4/) (") (2p 2 + 3Pq)(5P + 3<7) (12) (206 - b 2 )(a 2 - 306) 

(13) (a + b)(a 2 -ab + b 2 ) (14) (o - b)(a 2 + ab + b 2 ) 

(15) Try working through the following steps. 

(a) Think of a positive whole number, and write down its square. 

(b) Add 1 to your original whole number, and multiply the result by the 
original number with 1 taken away from it. 

(c) Repeat this process twice more. 

(d) Describe in words what seems to be happening. 

(e) Must this always happen whatever your starting number is? 

Show that it must by taking a starting number of n so that you can see exactly 
what must happen every time. 

l.B.(c) More factorisation: putting things back into brackets 

Again, the reverse process to multiplying out two brackets is called factorisation. Very often 
it is important to be able to replace a more complicated expression by two simpler 
expressions multiplied together. 

We have already done some examples of this, when we were working with the difference 
of two squares in the previous section. 

What happens, though, if there is a middle bit to be sorted out? 
For example, suppose we have x 2 + Ix + 12. 
Can we replace this expression by two multiplied brackets? 
We would have x 2 + Ix + 12 = (something) (something), and we have to find out what 
the somethings must be. 

We can see that we will need to have x at the beginning of each of the brackets. 
Both signs in the brackets are positive since the left-hand side is all positive, so at the 
ends we need two numbers which when multiplied give +12 and which when added give +7. 
What two numbers will do this? 



+3 and +4 will do what we want, so we can say x 2 + Ix + 12 = (x + 3) (x + 4), giving 
us an alternative way of writing this expression. 
Equally, x 2 + Ix + 12 = (x + 4)(x + 3). 
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The order of the brackets is not important because multiplication of numbers gives the 
same answer either way on. For example, 2X3 = 3X2 = 6. 

In all the questions which follow, your answer will be equally correct if you have your 
brackets in the opposite order from mine. 

exercise 1.B.2 Try putting the following into brackets yourself. 

(1) x 2 + 8x + 7 (2) p 2 + 6p + 5 (3) x 2 + jx + 6 
(4) x 2 + 5X + 6 (5) y 2 + 6y + 9 (6) x 2 + 6x + 8 
(7) o 2 + 70 + 10 (8) x 2 + 9X + 20 (9) x 2 + 13X + 36 

Now, a step further! Suppose we have 2x 2 + 7x + 3 = (something) {something). This time 
we need 2x and x at the fronts of the brackets to give the 2x 2 . If it is possible to factorise 
this with whole numbers then the ends will need 1 and 3 to give 1X3 = 3. 
Do we need (2x + 3)(x + 1) or (2x + l)(x + 3)? 



Multiplying out, we see that 

(2x + 3)(x + 1) = 2x 2 + 5x + 3 which is wrong, 

(2x + l)(x + 3) = 2x 2 + 7x + 3 so this is the one we need. 

exercise 1.B.3 Try factorising these for yourself now. 

(1) 3X 2 + 8x + 5 (2) 2/ 2 + isy +7 (3) 3° 2 + HO + 6 
(4) 3X 2 + 19X + 6 (5) sp 2 + 23P + 12 (6) 5X 2 + i6x + 12 

The system is exactly the same if the expression involves minus signs. Here are two 
examples showing what can happen. 

example (1) Factorise x 2 - lOx + 16. 

Here we require two numbers which when multiplied give +16, and 
which when put together give -10. Can you see what they will be? 



Both the numbers must be negative, and we see that -2 and -8 will 
fit the requirements. This gives us x 2 - lOx + 16 = (x - 2)(x - 8) = 
(x - 8)(x - 2). 



example (2) Factorise x 2 - 3x - 10. 



Now we require two numbers which when multiplied give -10 and 
which when put together give -3. Can you see what we will need? 



This time, to give the -10, they need to be of different signs. 
We see that -5 and +2 will do what we want, so we have 

x 2 - 3x - 10 = (x - 5)(x + 2) = (x + 2)(x - 5). 

Remember that it makes no difference which way round you write 
the brackets. 



l.B Multiplications and factorising: the next stage 
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exercise 1.B.4 Now try factorising the 

(1) x 2 - lix + 24 
(4) p 2 + 5P - 24 
(7) 3* 2 ~ iox - 8 
(10) 3b 2 - 20b + 12 



following yourself. 
(2) y 2 -9Y + 18 
(5) x 2 + 4X - 12 
(8) 20 2 - 30 - 5 
(11) 9X 2 - 25J/ 2 



(3) x 2 - lix + 18 
(6) 2q 2 - 5g - 3 

(9) 2X 2 - 5X - 12 

(12) i6x 4 - 8iy 4 , a sneaky one! 




l.C Using fractions 

Very many students find handling fractions in algebra quite difficult, but it is important to 
be able to simplify these fractions as far as possible. This is because they often come into 
longer pieces of working and, if you do not simplify as you go along, the whole thing will 
become hideously complicated. It is only too likely then that you will make mistakes. 

This section is designed to save you from this. You will find that if you understand how 
arithmetical fractions work then using fractions in algebra will be easy. If you have been 
using a calculator to do fractions, it's likely that you will have forgotten how they actually 
work, so I've drawn some little pictures of what is happening to help you. 

If you think that you can already work well with fractions, try some of each exercise to 
be sure that there are no problems before you move on to the next section. 

Because we are looking here at what we can and can't do with fractions, we shall need 
to use the sign ^. 



The sign ^ means 'is not equal to'. 



l.C.(a) Equivalent fractions and cancelling down 



a 

— means a divided by b. 
b 

a is called the numerator and b is called the denominator. 



In dividing, the order that the letters are written in matters, unlike a X b, which is the 
same as b X a. 
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The order also matters with subtraction; a - b is not the same as b - a unless both a and 
b are zero. But a + b = b + a always. 

2 3 

For example, 2X3 = 3X2 and 2 + 3 = 3 + 2, but 5^5 an d 2-3^3-2. 

a+ b a b 2+3235 

Also, = — 1 — . For example, = — 1 — = — . 

c c c ' 1111 

The whole of a + b is divided by c, and so we can get the same result by splitting this up 
into two separate divisions. The line in the fraction is effectively working as a bracket. 

a + b (a + b) 

In fact, it is safer to write as if it is part of some working. 

c c 

a 

In , the number a is divided by the whole of the number (b + c). 

b + c 

From this, we see that 




You can check this by putting a = 4, b = 2, c = 3, say. 
Dividing by c is the same as multiplying by lie, so 

a + b 1 

= - (a + b). 

c c 

For example, if a = 6, b = 4, and c = 2 then 
6 + 4 , 

— = |(6 + 4) = 5. 

If you find half of 10, it is the same as dividing 10 by 2. 

Fractions always keep the same value if they are multiplied or divided top and bottom by 
the same number, so 

4 8 6 2 

— = — = — = — , etc. 
6 12 9 3 

These are shown in the drawings in Figure l.C.l. 

These four equal fractions are said to be equivalent to each other. The process of dividing 
the top and bottom of a fraction by the same number is called cancellation or cancelling 
down. 




Figure l.C.l 



l.C Using fractions 
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(2\ 4X2 4X2 2 

For example, 4 — = not which is still — . 

\3/ 3 4X3 3 

In words, four lots of two thirds is eight thirds. 

This works in exactly the same way with fractions in algebra. 

So, for example: 

2a 2 

— = — (dividing top and bottom by a) 
5a 5 

xw x 

— = — (dividing top and bottom by w) 

yw y 

2a 3 b 2a 

and — — = — (dividing top and bottom by a b). 
a b b 

Check these three results by giving your own values to the letters. 

When doing this, it is important to avoid values which would involve you in trying to divide 
by zero, because this cannot be done. 

You can use a calculator to investigate this by dividing 4, say, by a very small number, 
say 0.00001. 

Now repeat the process, dividing 4 by an even smaller number. 

The closer the number you divide by gets to zero, the larger the answer becomes. In fact, 
by choosing a sufficiently small number, you can make the answer as large as you 
please. 

If you try to divide by zero itself, you get an ERROR message. 



exercise l.c.i Cancel down the following fractions yourself as far as possible. 

9 6 25 il 5x ab 

(i) — (2) — (3) — (4) - 1 (5) — (6) — 
12 30 95 64 8x ac 

, \ 3/ 2 ^ 8 PV , x 4° 2 , . 3* 2 y 3 , . 6p 2 g 50b 

(7) (8) (9) — do) (11) (12) — 

2y 2q 2ab 2xy 4 SPQ ° 3 



l.C.(b) Tidying up more complicated fractions 

Sometimes, the process of factorising will be very important in simplifying fractions. Here 
are some examples of possible simplifications, and some warnings of what can 't be done. 
If you have always found this sort of thing difficult, it may help you here to highlight the 
matching parts which are cancelling with each other in the same colour. 
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(1) 



(4) 



(5) 



xy + xz x(y + z) y + z 

xw xw w 

dividing top and bottom by x. 



ab + ac a(b + c) 

(2) = = a 

b + c b + c 

dividing top and bottom by the whole chunk of (b + c). 

ab + c 

(3) can't be simplified. 

b + c 

We can't cancel the (b + c) here because a only multiplies b. 

x + xy x(l + y) 1 + y 



x 2 x 2 X 

dividing top and bottom by x. 

x 2 + 5x + 6 (x + 3)(x + 2) x + 3 
x 2 -3x-10 (x-5)(x + 2) x-5 
dividing top and bottom by (x + 2). 



x (x + xy) 

(6) = x(x 2 + xy) 

x 

dividing top and bottom by x. 




x(x 2 + xy) 
It is not true that = x + y. 



This wrong answer comes from cancelling the x twice on the top of the fraction, but only 
once underneath. 

It is like saying |(4)(6) = (2)(3) = 6 but really |(4)(6) = |(24) = 12. 
You can halve either the 4 or the 6 but not both! 




xy + z y + z 
(7) is not the same as . 

xw w 



We cannot cancel the x here because x is only a factor of part of the top. You can check 
this by putting x = 2, y = 3, z = 4, and w = 5. Then 

xy + z 10 y + z 7 

= — =1 and = — 

xw 10 w 5 



l.C Using fractions 
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If we had put x = 1 , the difference would not have shown up, since both 

7 

answers would have been 5. 

This is because multiplying by 1 actually leaves numbers unchanged. 

This example shows that checking with numbers is only a check, and never 
a proof that something is true. 



exercise 1.C.2 Try these questions yourself now. 



(1) Which of the following fractions are the same as each other (equivalent)? 

2 4 12 10 2 6 ax a a{c + d) a 2 x 

( a ) ~> ~> ~> > ~ > ~ w ~ > ~> 77 ~r> ~ 

3 9 18 15 6 9 bx b b{c + a) abx 

ab + ac ab + c b + c x xz xp 

(c) — , -, — — (d) 



ad ad d x + y xz + yz x + yp 

(2) Factorise and cancel down the following fractions if possible. 

, . 2x + 6y 60-96 px-pq 

(a) (b) (c) — 

6x -By i\a - 6b p 2 - px 

3X + 2y 2xy + $xz lxz + 6yz 

(d) (e) (f) ^ y 

6x 6x ix + yy 

/ ^ 2 P " 3° ~* x 2 ~y 2 X 2 + 5X + 6 

(g) (h) (1) 

ip + 30 (x + y) 2 



l.C.(c) Adding fractions in arithmetic and algebra 

It is particularly easy to add fractions which have the same number underneath. 

2 3 5 

For example, 7 + 7 = 7. I've drawn this one in Figure 1.C.2 below. 




Figure 1.C.2 



If the fractions which we want to add don't have the same denominator then we have to 
first rewrite them as equivalent fractions which do share the same denominator. 

2 3 2 8 3 9 
For example, to find — + — we use — = — and — = — . 

3 4 3 12 4 12 
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The two fractions have both been written as parts of 12. The number 12 is called the 
common denominator. It's now very easy to add them, and we have 

2 3 8 9 17 

3 4 12 12 12' 

The answer of % can a l so be written as lf^, but in general, for scientific and engineering 
purposes, it is better to leave such arithmetical fractions in their top-heavy state. 

You should be safe now from the most usual mistake made when adding fractions, which 
is to add the tops and add the bottoms. 



13 1 + 3 

— I — (for example) is not 



6 + 4 10 



We can see that this must be wrong from Figure l.C.3. 




Figure l.C.3 



exercise i.c.3 Since the process in arithmetic is exactly the same as the process we use to add 

fractions in algebra, it is worth practising adding some numerical fractions 
yourself without using a calculator, before we move on to this. 
Try adding these three. 

32 24 124 

(1) - + - (2) - + - (3) - + - + - 
4 7 3 5 235 

The letters work in exactly the same way as the numbers. We can say 

a c ad be ad + be 

b d bd bd bd 

where a, b, c and d are standing for unknown numbers, and neither b nor d are zero. We have 
written both fractions as parts of bd to make it easy to add them. 
Indeed, we can say 

AC AD BC AD + BC 



B D BD BD BD 

where A, B, C and D are standing for whole lumps or chunks of letters and numbers. 
As an example of this, we will find 

x + 2y 3x + 2y 

+ . 

x - y x + 3y 



l.C Using fractions 
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Here, A = x + 2y, B = x - y, C = 3x + 2y and D = x + 3y. So we have: 

(x + 2y)(x + 3y) (3x + 2y)(x - y) (x + 2y){x + 3y) + (3x + 2y)(x - y) 
(x - y)(x + 3y) (x + 3y){x - y) (x - y)(x + 3y) 

x 2 + 5xy + 6y 2 + 3x 2 - xy - 2y 2 
(x - y)(x + 3y) 

4x 2 + 4xy + 4y 2 4(x 2 + xy + y 2 ) 
(x - y)(x + 3y) (x - y)(x + 3y) 

We don't usually multiply out the brackets on the bottom, because then we might miss a 
possible cancellation. (This saves you some work.) 

3x -2 2x - 3 

Try combining + into a single fraction, yourself. 

x + 3 x + 1 



The working should go as follows: 

(3x - 2)(x + 1) (2x - 3)(x + 3) (3x - 2)(x + 1) + (2x - 3)(x + 3) 

+ = 

(x + 3)(x +1) (x + l)(x + 3) (x + 3)(x + 1) 

3x 2 + x - 2 + 2x 2 + 3x - 9 
(x + 3)(x + 1) 

5x 2 + 4x- 11 
(x + 3)(x + 1) ' 

(Remember that the order in which we multiply the brackets doesn't matter.) 

l.C.(d) Repeated factors in adding fractions 

Sometimes, the addition is a little easier because there is a repeated factor. Here's a 
numerical example of this. 

3 5 

— + — has a repeated factor of 2 underneath. 

4 6 

So, instead of saying 

3 5 18 20 38 19 

4 6 24 24 24 12 
we can say more directly 

3 5 9 10 19 

4 6 12 12 12' 

The number 12, which is the smallest number which both 4 and 6 will divide into, is called 
the lowest common denominator or l.c.d. for short. 



This same simplification applies to fractions in algebra. 
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EXAMPLE (l) 



x(x + 3) x(2x - 1) 

There is a repeated factor of x underneath, so we say 
2 2(2x - 1) 



and 



So 



x(x + 3) x(x + 3)(2x - 1) 
3 3(x + 3) 



x(2x - 1) x(2x - l)(x + 3) 

2 3 2(2x - 1) + 3(x + 3) 

x(x + 3) x(2x - 1) x(x + 3)(2x - 1) 

7x + 7 7(x + 1) 



x(2x - l)(x + 3) x(2x - l)(x + 3) 

You can follow through this example experimentally, converting it into arithmetical fractions 
by putting in some value of your choice for x. 

Be careful though! There are three values which you mustn't choose. Can you see what 
they are? 



You can't have x = 0 or x = -3 or x = j, because each of these values would involve 
trying to divide by zero, which is impossible as we saw at the end of Section l.C.(a). 

In this example, it would not have been wrong to put everything over the common 
denominator of x(x + 3)x(2x - 1) or x 2 (x + 3)(2x - 1). It would just have taken longer to 
work out. 

2x 3y 

EXAMPLE (2) + 

y(3x - 2y) 4x(3x - 2y) 

Here, (3x - 2y) is a repeated factor underneath, so the expression is 
equal to 

(2x)(4x) 3y(y) _ 8x 2 + 3y 2 

y(3x - 2y)(4x) 4x(3x - 2y)(y) 4xy(3x - 2y) ' 

Check this example by putting x = 4, y = 2 and z = 5. 



You should get 

8 6 8(16) + 6(2) _ 128 + 12 _ 140 _ 35 
2(8) 16(8) 32(8) 256 256 64' 

8(16) + 3(4) 128 + 12 140 35 
32(8) 256 256 64' 
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exercise i.c.4 Try these for yourself. 



27 5 3 13 5 

(1) - + — (2) - + - (3) - + - + - 

9 15 6 8 346 

3x5V 25 43 

(4) , + , , (5)- r + ^^ (6)-^- + 



y(2x - y) x(2x - y) x(yc + 1) x(2x - 1) x 2 - y 2 (x + y) 2 



l.C.(e) Subtracting fractions 

Subtraction works in exactly the same kind of way as addition, so, for example 

2 5 2X8 5X3 16 15 1 

3 8 3 X 8 8 X 3 24 24 24 ' 
In just the same way, 

a c ad cb ad - be 
b d bd db bd 

where a, b, c and d are standing for numbers such as the 2,3,5 and 8 we had in the first 
example. 

Equally, just as in adding fractions, we can say that 
A C AD-BC 
B D BD 
where A, B, C and D stand for any chunks of letters and numbers. 

^^^^ The line in a fraction works in the same way as a bracket. If we are adding 
^B-^V fractions this won't affect what happens, but if we are subtracting them we 
have to be careful. For example, suppose we have 

4x - 3 2x + 1 

2 3 

The minus sign in the middle is affecting the whole right-hand chunk. We 
can show this most safely by rewriting using brackets. Then we have: 

(4x - 3) (2x + 1) 3(4x - 3) 2(2x + 1) 

2 3 3X2 2X3 

3(4x - 3) - 2(2x + 1) 
6 

12x-9-4x-2 
6 

_ 8x - 1 1 
6 

The safest strategy is always to put the brackets in, because then they will be 
there on the occasions when their presence is vital. 
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EXERCISE 1.C.5 



Try these mixed additions and subtractions yourself. 



3x- 5 2x- 3 
(1) + 



(3) 



(5) 



10 15 
3m - 5/7 3m - jn 



(2) 



30 + 56 0-36 



2/3 30 

(4) " + 



20 36 
+ 



(o + b)(3o + b) (a - b)(3a + b) 



(6) 



0(20 + b) 0(20 + b) 

5 2 
x 2 - y 2 x(x + /) 



l.C.(f) Multiplying fractions 

This is very straightforward. (It is much easier than adding!) We simply say 



a c ac 
- X - = — . 
b d bd 



That is, we multiply the tops, and multiply the bottoms. 

We can take f x 5 = n = 5 asa numerical example of what's happening. If you take 
two thirds of three quarters, you get one half. I show this happening in Figure l.C.4. 




Figure l.C.4 



lfA,B, C and D are standing for any chunks of letters and numbers, 

A C AC 

then we can say — X — = . 

B D BD 



It may then be possible to cancel down, for example 

x(b + c) y xy (b + c) 1 

X = = — 

y 2 x 2 (b + c) x 2 y 2 (b + c) xy 

dividing top and bottom by xy(b + c). You should always cancel down the answer like this 
if it is possible. The reason for this is that often fractions like this come in as part of the 
working out of a larger problem, and it pays to simplify them as much as possible before 
going on to the next step, to make that next step as easy as possible for yourself. 



l.C Using fractions 
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You can also do the cancelling before you do the multiplying if you want; I show the 
working done this way in Figure l.C.5. Cancellations are usually shown by diagonal lines. 
Notice that, when everything on the top cancels, we finish up with 1 not 0. 



Figure l.C.5 



l.C.(g) Dividing fractions 

The rule for dividing fractions is to turn the second fraction upside down and then 
multiply. 



a c a d ad 
b d b c be 



We can see that this works by taking the numerical example of one and one half divided 
by one half. We get 

3 13 2 ! 
— + — = — X — = 3 (that is, there are three halves in 1 -j). 
2 2 2 1 

exercise i.c.6 Now try these questions, cancelling down your answers where possible. 

2 3 2X - 1 X - 7 

(l) (2) ~ 

x(2x - 3y) 2x(x + 4/) 3 5 

30 2 ab 20 b 2 3X 2x 2 

(3) (a) — x — (b) — — (c) ~ 



ib 6c 30 90 2 y 2 z $yz 2 

3X 2 (2X + 3/) y 2 (x - y) spq(p + q) (3p + 2q) 

(4) (0) — X — (b) X 



2/ (x - y) x(x + 3y) (3p + 20) q 2 (sp - q) 

(a 2 - b 2 Y (o 4 - b 4 ) 

(c) X Be cunning! 

(o 2 + b 2 ) (0 + b) 4 

l.D The three rules for working with powers 

l.D.(a) Handling powers which are whole numbers 

It will be useful for us now to spend some time looking in more detail at how numbers 
written as powers of other numbers can be combined with each other. (We have already 
looked briefly at the rules for multiplying such numbers in Section l.A.(c).) 

We'll use the four numbers 8 = 2 3 , 32 = 2 5 , 9 = 3 2 and 81 = 3 4 as examples. 

We could combine these numbers in many ways, some of which I have written down here. 

(1) 32 X 8 (2) 9 X 81 (3) 32 ^ 8 

(4) 81-9 (5) 8 X 9 (6) 81 32 (7) 8 2 
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If we rewrite the numbers as powers, we get the following results. 

(1) 32 X 8 = 2 5 X 2 3 = (2 X 2 X 2 X 2 X 2) X (2 X 2 X 2) = 2 5 +3 = 2 8 = 256. 
The answer to the multiplication can be obtained by adding the powers. 

(2) Similarly, 9 X 81 = 3 2 X 3 4 = (3 X 3) X (3 X 3 X 3 X 3) = 3 2+4 = 3 6 = 729. 
Again, the result can be obtained by adding the powers. 

2X2X2X2X2 

(3) 32 = 8 = 2 5 = 2 3 = = 2X2 = 2 5 ~ 3 = 2 2 = 4 

2X2X2 

This time, the answer has been obtained by subtracting the powers. 

3X3X3X3 

(4) Similarly, 81 = 9 = 3 4 - 3 2 = = 3X3 = 3 2 = 9 

3X3 

and again the result is obtained by subtracting the powers. 

(5) 8 X 9 = 2 3 X 3 2 . This time, the calculation is made no easier by writing the 
numbers in this form. As they are powers of different numbers, we cannot use the 
same system as we did in (1) and (2). 

Returning to the original form, 8 X 9 = 72. 

(6) Similarly, there is no advantage to be gained by writing 81 = 32 as 3 4 = 2 5 . 
81 

— can be left like this, or written in decimal form as 2.53125. 
32 

(7) 8 2 = (2 X 2 X 2) 2 = (2 X 2 X 2) X (2 X 2 X 2) = 2 6 and 8 2 = (2 3 ) 2 = 2 6 . 
The answer comes from multiplying the two powers. 

Any powers which are whole numbers will work in the same kind of way, so we will now 
write down the three rules or laws for working with powers. 



The three rules for powers 




Rule (1) 


a m X a" = a m+n 






Example: a 2 X a 3 = 


(a X a) X (a X a X a) = a 5 . 


Rule (2) 


a m = a" = a m -" 








aXaXaXaXa 




Example: a 5 = a 2 = 


- a 3 . 






a X a 


Rule (3) 


{a m ) n = a mn 






Example: (a 2 ) 3 = (a 


X a) X (a X a) X (a X a) = a 6 . 



We saw from the numerical examples that we must have powers of the same number for 
these rules to work. There, we used either 2 or 3, and for the rules above I have used a. The 
number a is called the base that we are working with. 
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l.D.(b) Some special cases 

It can be shown that the three rules above are true for any values of m and n, provided that 
a * 0, but it is not possible for us to prove this yet. However, by using powers which are 
whole numbers we can see how some particular cases will have to go. 

a X a X a 

(1) a 3 - a 2 = = a and, by Rule (2), a 3 + a 2 = a 3 ' 2 = a 1 . 

a X a 

So we must have 
a 1 = a. 



a X a X a 

(2) a 3 - a 3 = = 1 and, by Rule (2), a 3 - a 3 = a 3 ~ 3 = a 0 . 

a X a X a 

So we must have 
«° = 1. 



(3) a 2 - a 3 = — — = - and by Rule (2), a 2 - a 3 = a 2 ' 3 = a~\ 

a X a X a a 

So we must have 



a 

1 

In fact, more generally, a " = — . 

a" 



(4) a m X a 112 = a x by Rule (1), and a 1 = a. 

So a 1/2 is the number which multiplied by itself gives a. 

a 112 means the square root of a. 



Similarly, a 1 ' 3 X a' /3 X a m = a 1 by Rule (1). 

So a 1/3 means the cube root of a, or 3 [a. 

and a 1/n means the «th root of a or a 17 " = "{a. 

Here are four examples. 

What are (1) 4 1/2 (2) 8 1/3 (3) 27 2/3 (4) 16 1/4 ? 
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(1) 4 means the square root of 4, so it means the number which multiplied by itself 
gives 4. There are two numbers which do this. What are they? 



They are + 2 and -2. So 4 1/2 = +2 or -2. 
We can write this as 4 1/2 = ±2. (The symbol ± means + or -.) 

(2) 8 1/3 means the cube root of 8 so it means finding a number a so that a X a X a = 8. 
What can a be? 



There is only one possible value for a in ordinary numbers, which is +2. 

(I say 'ordinary numbers' here because it is possible to extend the number system so that 
other possibilities open up. In fact, as we shall see in Chapter 10, we then rather pleasingly 
get three cube roots. But for the present, we are only interested in solutions in ordinary 
numbers.) 

(3) 27 2/3 = (27 1/3 ) 2 by Rule (3). But 27 1/3 = 3 so (27 1/3 ) 2 = 3 2 = 9. 

(4) 16 1/4 means the fourth root of 16. What are the possibilities here? 



There are two possibilities using ordinary numbers. 

We have 2X2X2X2=16 and -2 X -2 X -2 X -2 = 16 so 16 1/4 = ±2. 

In general we can say that each even root of a positive number has two possible solutions, 
and each odd root of either a positive or a negative number has just one solution. 

At present, we cannot find any even roots of negative numbers, although in Chapter 10 
we will find out how it is possible to extend the number system so that we can have roots 
for these numbers too. Have a guess at how many fourth roots of 16 we shall then have. 



Yes, it is most satisfyingly four. 

exercise i.d.i It is very useful to get a feeling for what these powers do, so that you can quickly 

recognise alternative ways of writing them, or possible simplifications. 

Try these numerical examples without a calculator to help you develop this feel. 

Then go through, checking all your answers on your calculator. If you have a 
mismatch, try to spot which one has gone wrong. Maybe the answers are the 
same but just in a different form? (Your calculator will only give you positive 
values for roots; you have to add possible alternative negative answers yourself.) 
Make sure that you know how powers work on your calculator; read its little 
instruction book if necessary! 

(i) 3" 1 (2) i6 l/2 (3) 9 3/2 (4) 2 7 " 1/3 (5) 4° (6) 7 1 

(7) 7~ 2 (8) 4" l/2 (9) 32 1/5 (10) i6" 3M (n) 2s^' 2 (12) 49" l/2 
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l.E The different kinds of numbers 

The number system has been invented and extended as people needed ways to describe ever 
more complicated situations and transactions. This procedure took thousands of years, so I 
have to compress it somewhat in this brief description. 

l.E.(a) The counting numbers and zero 

By inventing names, with symbols for those names, it became possible to count how many 
distinct objects there were when they were collected together. It was also then possible to 
count the totals when collections were combined together, provided enough names or 
symbols had been invented. 

Having a symbol for zero was a great advance. The oldest written record with a symbol 
for zero dates from the ninth century in a Hindu manuscript. 

We don't very often have to say that we have none of something. So why is having a 
symbol for zero so important? 



It makes it possible to put in all the necessary place values in our system for writing 
numbers, for example 301. Having a place value system means that once the symbols for 1 
to 9 are learnt, a number of any size can be written. This use of the symbol for zero was 
ridiculed by some people when it was first adopted. How could it be possible to write a large 
number, they said by using lots of symbols which each individually stand for nothing? 

The fact that it took two centuries before this symbol for zero was invented shows what 
a subtle development it was. 

l.E.(b) Including negative numbers: the set of integers 

The first important extension to the system of counting numbers for a collection of objects 
is having some arrangement to represent what happens if we want to take away more than 
we have, so that we owe. 

If we include the negative numbers we can do this. 

We now have the number system of integers given by 

... -4, -3, -2, -1, 0, 1, 2, 3, 4, ... 

The German mathematician Kronecker said of these numbers: 'God made the whole 
numbers; everything else is the work of man.' 
Also now we have a nice symmetry. 

For every number there is another number so that put together they make zero, so each 
number has its matching pair. These pairs of numbers are reflections of each other around 
zero. What are the pairs of (a) +7, (b) -9, and (c) 0? 



(a) +7 has the pair -7. (b) -9 has the pair +9. (c) 0 is its own pair. 

Putting together any two numbers in this system gives us another number in the system. 
It has a nice completeness about it. 

l.E.(c) Including fractions: the set of rational numbers 

The next major extension to the number system results from the requirement of being able 
to divide quantities up. To do this, we have to include fractions, that is, numbers which 
can be written in the form alb where a and b are integers or whole numbers, excluding 
the case when b = 0. These numbers are called the rational numbers. Then the integers 
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themselves come from the special case in which b = 1, so they are included in this 
description. 

We can now divide quantities into smaller amounts, even if the numbers involved mean 
that the result of the division is not a whole number (provided of course that the quantity 
concerned is physically divisible into non-integer amounts). 

We have a second nice symmetry here, this time about 1 . 

For every number except zero, there is now another number so that multiplied together 
we get 1 . For example, j has the pair 

What are the pairs of (a) f, (b) -§ and (c) 1? 



(a) 7 has the pair 3. (b) -5 has the pair -5. (c) 1 is its own pair. 

Putting together any two numbers in this system by multiplying them together gives us 
another number in the system, so we have exactly the same sort of completeness that we had 
above with adding. The two systems have the same underlying structure of each number 
having its own individual partner so that each pair together gives a special number, zero in 
the case of adding and 1 in the case of multiplying. 

If we put little tiny points for the value of each possible fraction on a number line how 
close will these points be together? Will there be any gaps? 

Suppose we have two fractions F l and F 2 which are very close together, say F x = jqq and 
F2 = T6T- 

Then, there must be at least one fraction which lies between these two. Can you think of 
one? 



There are lots of possibilities for this. In particular, we could take {F l + F 2 )/2 . 

201 

This is exactly midway between F 1 and F 2 . Here, it would be -fojoo . 
This system of insertion can be infinitely repeated, so we see that there can't be any 
spaces between these fractions. 

l.E.(d) Including everything on the number line: the set of real numbers 

If the fractions are packed infinitely closely together, where is y2? 

Is it a fraction? Trying a few possibilities doesn't look very promising, but maybe we just 
haven't got the right numbers. 

Suppose that it is possible, and we have found a and b so that 

a a 2 

— = v'2 so — = 2 
b b 2 
and therefore 

a 2 = 2b 2 . 

We'll also suppose that any possible cancelling down of the fraction alb has already been 
done, so it is tidied up as much as possible. 
What kind of number must 2b 2 be? 



It must be even, so a 2 must be even as well. 

What happens if you square (a) even numbers (b) odd numbers? 
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An even number squared gives another even number and an odd number squared gives 
an odd number. We can show this by writing even numbers as 2n (with n standing for any 
whole number) and odd numbers as 2n + 1. 

Then (2n) 2 = An 2 and (2n + l) 2 = An 2 + An + 1. 

Because of this, we see that the number a must be even. We could call it 2a i to show 
this. Then 

a 2 = (2a 1 )(2a 1 ) = Aa\ = 2b 2 which means that b 2 = 2a\. 

Now, by the same argument as before, b must also be even, so a and b could have been 
cancelled down. 

But if we cancel them, we can use exactly the same argument to show that they would 
cancel down again, and so on for ever. 

So there is no fraction which is exactly equal to /2. 

This argument is due to the Pythagoreans of Ancient Greece. They were disconcerted and 
alarmed by such numbers, which they called 'incommensurable'. There is a story that the 
first Pythagorean to show their existence was thrown into the sea for his pains. 

In fact, yfl is somewhere between j^j and {^jjj. So although the fractions are packed 
infinitely closely, there are still gaps where the numbers like yf2, yfl, etc. are. 

(This is one of the mysteries of maths and is because infinite numbers of things behave 
in very peculiar ways.) 

These numbers, together with ji and similar numbers, are called irrational numbers. The 
rational and irrational numbers together are called the set of real numbers. 

Here's another example of how infinite quantities of things behave in unexpected 
ways. 

If we have two collections or sets of objects and we can tally off each object in the first 
set with a corresponding object in the second set and vice versa, like knives and forks in 
place settings, then the two sets must have an equal number of objects in them. 

Or must they? 




Suppose we start with the two lines meeting at O which I have drawn above in Figure 
l.E.l, and we then draw parallel lines like AP and BQ so that points is matched with point 
P and point B is matched with point Q. All the points on the two lines can be paired off in 
this way, so the two lines must be equal in length. But clearly they are not! We can no longer 
say that the sets are equal because now there are an infinite number of objects involved and 
the usual rules no longer apply. 
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l.E.(e) Complex numbers: a very brief forwards look 

Finally, to make the list complete, we will jump ahead of ourselves briefly. We know that 
2X2 = 4 and -2 X -2 = 4. So the square root of 4 is +2 or -2. But we have no number 
for the square root of -4. 

In Chapter 10, we shall find out how it is possible to extend the number system even 
further so that we can have an answer for ^-4. In fact, even better, we get two answers, just 
like y[4 has two answers. 

We get this extension by including the so-called imaginary numbers. The real and 
imaginary numbers together form the set of complex numbers. 

l.F Working with different kinds of number: some examples 

l.F.(a) Other number bases: the binary system 

We have to use ten symbols for writing numbers because our counting system is based on 
10. Our whole system is therefore called the decimal system, although in ordinary speech 
we use 'decimals' for just the fractions written in this system. 

However, other bases can be used. One of the most important of these is the system based 
on 2, the binary system. This involves counting in place values given by powers of 2 instead 
of powers of 10. 

So, for example, 

324 in the decimal system = 4(10°) + 2(10 1 ) + 3(10) 2 = 4 + 2(10) + 3(100). 

11001 in the binary system = 1(2°) + 0(2 ! ) + 0(2 2 ) + 1(2 3 ) + 1(2 4 ) 

= 1 + 0(2) + 0(4) + 1(8) + 1(16) 

= 1 + 8 + 16 = 25 in the decimal system. 

Notice that, in each case, we have processed the number from right to left, instead of from 
left to right. 

In each case, we wrote down the number of units, the number of 'tens', the number of 
'hundreds', etc., where the 'ten' or 10 of the binary system is 2, the 'hundred' or 100 of the 
binary system is 2 2 or 4, and so on. 

Counting in binary goes 1, 10, 11, 100, 101, 110, 111, 1000, etc. instead of the decimal 
1, 2, 3, 4, 5, 6, 7, 8, etc. 

The binary system only requires two symbols to write, those for one and zero, which is 
why it is so important. The separate digits of numbers written in this system can be 
represented by electric current either flowing or not flowing in a circuit, and therefore 
numbers can be handled in this form by computers. 

exercise i.f.i Try converting these three binary numbers into decimal numbers for yourself. 

(l) 10111 (2) 1111 (3) 111011 

How can we go the other way, and convert decimal numbers into binary numbers? 
If we have the number 109, say, we could do it just by splitting it up into powers of two. 

109 = 64 + 45 = 64 + 32 + 13 = 64 + 32 + 8 + 5 

= 64 + 32 + 8 + 4 + 1 = 2 6 + 2 5 + 2 3 + 2 2 + 1 

= 1101101 in binary or base 2. 

(A useful way of showing that this number is in base 2 is to write the 2 as a little 
subscript, so we write the number as 1101101 2 .) 
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This is good for seeing what is happening, but not so good as a standard method of 
conversion. 

What we have actually done here is to split the number up into progressively higher 
powers of 2, which we can do equally well by repeatedly dividing it by 2, recording the 
remainder at each stage so we get the smaller powers as they shed off. 

I show the working for this below. 



Remainder 



2 


109 




2 


54 


1 


2 


27 


0 


2 


13 


1 


2 


6 


1 


2 


3 


0 


2 


1 


1 




0 


1 



T 



The answer is: 

109 10 is the same as 1 101 101 2 . 



EXERCISE 1.F.2 



Try converting these three decimal numbers to binary numbers for yourself, 
(l) 72 (2) 2431 (3) 3251 



If you have the use of a computer and know a programming language, you 
could write a program to do this, since the process of dividing by 2 is a 
repeated loop until the number being divided is itself less than 2. You just have 
to record the remainders so that you can display or print out your binary 
conversion at the end. 



etc. 



This system works equally well in other number bases. 
For example, in base 8, we have a 'ten' of 8 and a 'hundred' of 8 2 , 
So 237 8 = 7 + 3(8*) + 2(8 2 ) = 7 + 24 + 128 = 159 10 . 
Working the other way round is done by repeated division by 8. 
So, for example, to convert 397 10 into base 8, you would do the working shown below. 



397 



49 



397 



10 



Remainder 



5 

6 1 
0 6 
= 615*. 



T 



Check: 615 8 = 5 + 1X8 
= 397 10 . 



6 X 8 2 
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l.F.(b) Prime numbers and factors 

In this section, we look briefly at how the different numbers are built up. 

Many numbers can be written as products (i.e. multiplications) of smaller numbers or 
factors in quite a few different ways, for example 

12 = 2X6 = 2X3X2 = 3X4 = 12 XI. 

Numbers which have no factors other than themselves and one are called prime 
numbers. No smaller number (except for 1) will divide into them exactly. 7, 1 1 and 19 are 
all examples of prime numbers. 

Are there any even prime numbers? 



Every even number can be divided exactly by 2, so there is just one even prime number, 
which is 2 itself. 

Every number can be written as a product of its prime factors, so for example 
15 = 3 X 5 and 12 = 2 2 X 3. 

Mathematicians have shown that every number can only be broken down into a product 
of prime factors in one way, so, if we split 126 as 2 X 3 2 X 7, we don't have to worry that 
maybe it could also be split so that it has some completely different prime factors. 

Is there a pattern for how prime numbers slot into the other numbers? Figure 1 .F. 1 shows 
all the prime numbers between 1 and 50, as shaded squares. 
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Figure l.F.l 

It doesn't look as though there is a pattern, although we do notice that many of them seem 
to come in pairs with just one number in between. We also see that, as we go down through 
the numbers, we are getting more and more possible prime factors for the numbers which 
we haven't yet reached. Does this mean that after a while we will have collected all the 
building blocks that we need to make future numbers, so that there will no longer be any new 
prime numbers? 

The answer to this question is that we will never have enough building blocks to make 
all the possible future numbers. Given any prime number, however large, it is always possible 
to find at least one larger one. 

We can show that this is true in the following neat way. 

We start by taking a numerical example, because it is easier then to explain how the 
argument goes. 
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Suppose we think that 23 might be the largest prime number. (I have deliberately chosen 
quite a small number here. It is, in fact, easy to find larger prime numbers than 23, but it 
will do very nicely to show how the general argument goes.) First, we list all the prime 
numbers up to 23. (We don't normally include the number 1 in these - 1 is its own special 
unique case of a number.) Doing this gives us 2, 3, 5, 7, 11, 13, 17, 19 and 23 itself. 

Next, we use all these prime numbers to write down a new number. This new number is 

(2 X 3 X 5 X 7 X 11 X 13 X 17 X 19 X 23) + 1. 

What kind of number is this? None of the prime numbers up to 23 will divide into it 
exactly, because each of these divisions would leave a remainder of 1 . So either it is itself 
a prime number, or it has prime factors which are larger than 23 . 

Either way round we have shown that there must be at least one prime number which is 
larger than 23, and we could use this argument in exactly the same way to show that if we start 
with any prime number N, then there must be at least one prime number larger than N. 

This very nice ingenious method is due to Euclid, a mathematician from Ancient 
Greece. 

l.F.(c) A useful application - simplifying square roots 

We can often use a number's prime factors to simplify its square root. 
For example, 

VT2 = x 2 X 2 X 3 . but ^2X2 = 2, so we can say Jl2 = 2^3. 

Here is another example. 

V72 = (2X2X2X3X3 = \ 2 X 2'' X 3 2 2 X 3 X x 2 6 x 2. 

When square roots appear as part of a long calculation, it often makes things much easier 
if you rewrite them like this. Using a calculator to find them is often not very helpful in mid- 
calculation because it frequently gives you a string of decimals which is very awkward to 
handle. 

exercise 1.F.3 Try some for yourself now. Simplify these numbers in the same way. 

(i) X 28 (2) V45 (3) V 50 (4) n 44 fc) n 63 (6) x 40 

l.F.(d) Simplifying fractions with v signs underneath 

In Section l.E.(d), I showed that y2 is irrational. Most square roots are irrational, the 
exceptions being numbers such as 2 = V4, 6 = y 36, etc. Numbers such as 4 and 36 are 
called perfect squares. 

If we have a number made up of two separate bits, one of which is rational and one of 
which is irrational, like 3 + y'5, then the combined number will be irrational. 

But the matching pair of numbers of 3 + y'5 and 3 - y'5 have two rather nice 
properties. 

We can see the first of these by adding them. 

This gives us (3 + y\5) + (3 - y' 5) = 6. (We have lost the irrational part.) 
Can you see what other good possibility we have? 



Multiplying them together also works very nicely. 
We get (3 + V5) (3 - V5) = 9 + 3V5 - 3 V5 - 5 = 4. 
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This is another application of the ubiquitous difference of two squares. (We have also 
used (V5) 2 = 5.) 

Fractions such as 5/(2 - y 3 ) are particularly unwelcome because they involve dividing by 
a number which is partly rational and partly irrational. We can get round this problem in the 
following way. 

5 5(2 + y'l) 

2 - V3 (2 - V3) (2 + V3) 

multiplying top and bottom of the fraction by 2 + y[3. This gives 

10 + 5>/3 10 + 5^3 

= 10 + 5J3. 



(2) 2 -(V3) 2 4-3 

We have cleverly got the y signs on the bottom to cancel out, by multiplying the fraction top 
and bottom by (2 + y'3). Then we use the fact that (y 3) 2 = 3. 

3 - V2 

As another example, we will simplify — — . 



The denominator (or underneath number) is particularly unpleasant this time. 

Can you see what we could multiply by to get rid of the y' signs on the bottom? Look 
again at the previous example if necessary. 



We multiply the top and the bottom by (y5 + y'2) and get: 

(3 X 2)( N 5 • N 2) 3 N 5 2 x 10 • 3 X 2 3 N 5 2 \ 1 0 3 N 2 

(V5 - y'2) (V5 + y/2) 5-2 3 

It may help you to recognise references to this process if you know that this process of 
removing the \ s on the bottom is called rationalising the denominator. Numbers like \[2 
are called surds. 

We shall use exactly this process in Chapter 10 to simplify complex numbers. 

exercise i.f.4 Try simplifying these three for yourself. 

, , 5 3 - VI , , 3 - 2^3 

W (2) F (3) p 

3 + \2 3 + V5 5 + 3x2 
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Graphs and equations 



In this chapter we look at different ways of solving equations. We shall do this both 
by using the algebra from the first chapter and also by seeing what the solutions we 
find mean when we look at them graphically. 

The chapter is split up into the following sections. 

2.A Solving simple equations 

(a) Do you need help with this? Self-test 3, (b) Rules for solving simple equations, 

(c) Solving equations involving fractions, 

(d) A practical application - rearranging formulas to fit different situations 
2.B Introducing graphs 

(a) Self-test 4, (b) A reminder on plotting graphs, 

(c) The midpoint of the straight line joining two points, (d) Steepness or gradient, 

(e) Sketching straight lines, (f) Finding equations of straight lines, 

(g) The distance between two points, 

(h) The relation between the gradients of two perpendicular lines, 

(i) Dividing a straight line in a given ratio 

2.C Relating equations to graphs: simultaneous equations 

(a) What do simultaneous equations mean? 

(b) Methods of solving simultaneous equations 

2.D Quadratic equations and the graphs which show them 

(a) What do the graphs which show quadratic equations look like? 

(b) The method of completing the square, 

(c) Sketching the curves which give quadratic equations, 

(d) The 'formula' for quadratic equations, 

(e) Special properties of the roots of quadratic equations, 

(f) Getting useful information from 'fa 2 - 40c', 

(g) A practical example of using quadratic equations, 

(h) All equations are equal - but are some more equal than others? 

2.E Further equations - the Remainder and Factor Theorems 

(a) Cubic expressions and equations, (b) Doing long division in algebra, 

(c) Avoiding long division - the Remainder and Factor Theorems, 

(d) Three examples of using these theorems, and a red herring 



2.A Solving simple equations 

2.A.(a) Do you need help with this? Self-test 3 

In the first chapter, we revised the various methods for using the rules of algebra to handle 
and simplify unknown quantities. We now see how we can use these rules to find 
information from different kinds of equation. In case you need to be reminded how to solve 
simple equations, I have put in another self-test here. As before, if you are in any doubt about 
how much you remember, you should try the test now because it is much easier to go 
forward happily if any problems are sorted out at the beginning. 
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Self-test 3 

Answer each of the following 
standing for in each case. 



short questions by 



finding the 



value which the letter is 



(1) 


x + 7 = 4 


(2) 


3v = 27 




(3) 


5y= 12 


(4) 


2/7 + 3 = 8 


(5) 


2a + 3 = 


5a - 2 


(6) 


10-26 


(7) 


3(2x - 1) = 2(2x + 3) 


(8) 


x 3 
4 5 




(9) 


3x 5 
8 9 


(10) 


8 

- = 2 

X 


(11) 


1 

2x + - = 
2 


3 
5 


(12) 


5 3 

y 7 


(13) 


x + 1 

- 5 

2 


(14) 


2y + 3 _ 
4 


5 


(15) 


2v + 1 
3 


(16) 


3x 

— + 3 = x - 5 
5 


(17) 


2x 

— -3 = 
3 


X 


(18) 


5 


2 


3a -2 


(19) 


3 2 


(20) 


2 


5 






p + 3 p+4 


2a + 1 


3a - 2 







_ J + 3 
2 

= 3 



Save your working on this test because I shall do most of these questions as examples, and 
you will be able to compare what you did with my solutions. Indeed, you might find as we go 
through that you can change some to make them right before you look at my version. 

If your present answers are right, give yourself one mark each for questions (1) to (10), 
and two marks each for questions (1 1) to (20), so the test has a possible total of 30 marks. 
If you have less than 25 marks, you should work through the next section. Remember that 
if you are in any doubt about your handling of these equations, it is best to get the difficulties 
sorted out straight away. 

The answers to the test are as follows: 

(1) x = -3 (2) y = 9 (3) y = f (4) p = § (5) a = f 

(6) b = 1 (7) x = § (8) x = y (9) x = f n (10) x = 4 

(ll)x = 55 (12)j = f (13)x = 9 (14) y = % (15) y = 1 

(16) x = 20 (17) x = 18 (18)a = y (19) p = -6 (20) a = -\. 



2.A.(b) Rules for solving simple equations 

Since the two sides of an equation are equal, in general you are safe if you do the same thing 
to each side. For example, the equation is still true if: 

• we add the same amount to each side; 

• we subtract the same amount from each side; 

• we multiply both sides by the same amount; 

• we divide both sides by the same amount, remembering that we must not try to divide 
by zero. (See the end of Section 1 .C.(a) for what happens then.) 

We can use these rules to simplify equations to the point where it is easy to see the 
solution. 
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Here is an example: 



3x + 17 =x + 7. 
Taking 17 from both sides gives 

3x = x + 1 - 17, 
so 3x = x - 10. 
Taking x from both sides gives 

2x = -10. 
Dividing both sides by 2 gives 

x = -5. 

We see from this example that adding or subtracting the same amount from each side has the 
same effect as shifting bits from one side of the equation to the other provided that we 
change the signs from + to - or - to + as we do so. 

We can now check the solution we have found by putting it back into the original 
equation. If it is correct then the two sides should indeed be equal, so we look at each side 
in turn. It is helpful to have a shorthand for this, and I shall use LHS to stand for the left- 
hand side and RHS to stand for the right-hand side. 

Here, putting x = -5, the LHS = 3 X -5 + 17 = 2, and the RHS =-5 + 7 = 2 also. 

As further examples, here are the solutions of the first seven questions of Self-test 3. 

(1) x + 7 = 4sox = 4-7 = -3. 

27 

(2) 3y = 27 so y = — = 9 (dividing both sides by 3). 

12 

(3) 5y = 12 so y = — (dividing both sides by 5). 

(4) 2p + 3 = 8 so 2p = 8 -3 = 5 and p = f . 

(5) 2a + 3 = 5a - 2 so 3 + 2 = 5a - 2a = 3a and a = f . 

(Notice, it was easier to rearrange here so that we had a positive number of the 
unknown amount.) 

(6) 10 - 2b = b + 7 so 10 - 7 = b + 2b = 3b and b = 1. 

(7) 3(2x - 1) = 2(2x + 3) so 6x - 3 = 4x + 6 so 2x = 9 and x = \. 



exercise 2.A.1 Try these for yourself now. The best method is to do what you comfortably can in 

your head, without chopping out so many steps that mistakes begin to creep in. 
Check that all your answers fit their equations. 

(l) x + 8 = 5 (2) sy = 40 (3) 2/ = 7 

(4) 7 + 2x = 5 - x (5) 4 + 2b = 5b + 9 (6) 3(x - 3) = 6 

(7) 3O-2) = 2(y-i) (8) 2(30-1) = 3(40 + 3) (9) 3x-i = 2(2x-i) + 3 
(10) 2(p + 2) = 6p-3(p-4). 



2.A.(c) Solving equations involving fractions 

I think that the easiest way to solve this kind of equation is to start by getting rid of the 
fractions. We can do this by multiplying both sides of the equation by a number chosen so 
that, after cancelling, we have only whole numbers to deal with. 
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I shall now use some further questions from Self-test 3 as examples of this, 
x 3 

<8) VI 

Multiplying both sides of the equation by 4 X 5 = 20, and cancelling, gives 

12 

5x = 4 X 3 = 12 so x = — . 

5 

1 3 

(11) 2x + - = - 

2 5 

Multiplying both sides by 2 X 5 = 10 gives 

10 (2x + j) = 10 X | so 20x + 5 = 6 so x = ^. 

Notice that I used a bracket to make sure that every separate piece of the original equation 
got multiplied by 10. 

5 3 

(12) - = - 

y i 

Multiplying both sides by 7y gives 

35 

7 X 5 = 3y so y = — . 

3 

This has the same effect as doing a sort of cross-multiplying of bottoms to tops. It is fine 
to use this method so long as you only do it for equations with single fractions each side. 
It wouldn't work for (11), for example. 

2y + 3 
(14) -^— = 5 
4 

Multiplying both sides by 4 gives 

17 

2y + 3 = 20 so 2y = 17 and y = 



(15) 



2 

2y + 1 y + 3 



3 2 

Multiplying both sides by 3 X 2 = 6 gives 

2(2y + 1) = 3(y + 3) so 4y + 2 = 3y + 9 and y = 7. 

2x x 
(17) — -3=- 
3 2 

Multiplying both sides by 3 X 2 = 6 gives 

/ 2x \ x 
6 — -3 =6X- so 4x-18 = 3x and x = 18. 
V 3 ) 2 



It is important to remember that the -3 also gets multiplied by the 6. Again, 
I've used a bracket to make clear that this is what I must do. 
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(18) ^T = 3 
3a - 2 

Multiplying both sides by (3a - 2) and cancelling on the left-hand side gives 
5 = 3(3a - 2) so 5 = 9a - 6 so 1 1 = 9a and a = y. 

2 5 

(20) = 

2a + 1 3a - 2 

Multiplying both sides by (2a + 1) (3a - 2), and cancelling, gives 

2(3a - 2) = 5(2a +1) so 6a - 4 = 10a + 5 so -9 = 4a and a = -\. 

My last example involves three fractions. Solve 
2x+l 3x - 2 x - 1 



3 4 6 

What should we multiply by to get rid of the fractions this time? 



Did you think of 3 X 4 X 6 = 72? This will do, but we could use the more delicate 
instrument of 12 since 3, 4 and 6 are all factors of 12. 

^^^^ This equation has a tricky bit which often leads to mistakes. Can you see 
^B-^V what it is? It was mentioned as a warning in Section l.C.(e). Try the next 

step yourself before looking at what I've done to see if you can avoid this 

pitfall. 



3x -2 2x + 1 

The whole of is being subtracted from . 

4 3 

The line of the fraction is acting in the same way as a bracket, and it is safest to put brackets 
round each fraction chunk to keep the working clear and the signs correct. 
Then, multiplying through by 12, we have 

'2x+l\ /3x-2\ fx -I 
121 - 12 = 12 



Cancelling each fraction in turn, we get 

4(2x + 1) - 3(3x - 2) = 2(x - 1) so 8x + 4 - 9x + 6 = 2x - 2 

(Leaving out the brackets could mean that you would wrongly have a -6 in this last 
equation.) 

So 4 + 6 + 2 = -8x + 9x + 2x therefore 12 = 3x and x = 4. 
Checking back, the LHS = f - -j = \ and the RHS = f = \. 



42 



Graphs and equations 



It is important that we can only get rid of fractions by multiplying if we are 
dealing with an equation. It will not work if we just have an expression such as 



4 x + 3 

— + . 



2 5 

Here we would have no justification for making this 10 times larger. 
The best we can do is to simplify as we did in Section l.C.(c). Then 

x + 4 x + 3 5(x + 4) 2(x + 3) 5(x + 4) + 2(x + 3) Ix + 26 

+ = + = = . 

2 5 10 10 10 10 



I've put in quite a lot of detail in these examples so that you can see exactly what's 
happening. As you get more confident, you'll find you probably don't need to write down 
all the steps. This is fine, but it's a good idea to check your answers to make sure that they 
do fit the given equations. 



exercise 2.A.2 Try these questions for yourself now. 

Solve each of the following equations. 

5* ix xx 

(l) — = 2 (2) 5 + x = — (3) = 1 

3 3 3 4 

, . y 3/ - 7 y ~ 2 3m - 5 9 - 2m 

(4) — (5) = 0 

3 5 6 43 

x - 1 x - 2 p + 13 23 

(6) 1 (7) (8) — 

23 p-14 y y + 1 

43 ix 3X 
(9) = (10) = 1 

2X + 3 X - 2 X+2X+5 
2X + 1 x+5 3X-1 X+3 X-l 2X-1 

(11) + — - = - — (12) — 

327 4 5 10 



2.A.(d) A practical application - rearranging formulas to fit different situations 

We can also use the rules for solving equations to rearrange formulas so that they are in a 
more convenient form to use in changed situations. 



example (1) The formula 




gives the period T of a pendulum of length /. The period is the length of 
time for a complete to-and-fro swing, jt is the ji of circles, and g stands 
for the acceleration due to gravity. 



2.A Solving simple equations 



If we want to find the length of a pendulum which has a given 
period, it would be more convenient to have the formula rearranged so 
that the length / is given in terms of the other quantities. This is 
sometimes called changing the subject of the formula to /. We have 



T= 2jt« 



I 



Since the two sides of an equation are equal, they must still be equal if 
we square both of them. Therefore 

/ 



T 2 = 4jt 2 



g 



(Notice that everything must be squared, including the 2jt.) So now we 
have 

gT 2 

I = (multiplying both sides by g and dividing by 4jz 2 ) 

An 2 

and this gives us the new formula we wanted. 

example (2) For this, I'll take the formula relating the distance u of an object from a lens 
of focal length / to the distance v of its image from the lens. This is 

1 1 1 

u v f 

Suppose you want to find the distance of the image from the lens for 
certain given distances of the object from the lens; you need a formula 
for v in terms of u and / 



Students sometimes think that they can go through the equation above turning 
everything upside down and it will still be true. This is not so! 



1 1 1 

It is true that - + - = - but 3 + 6*2. 
3 6 2 

Remember, it is only possible to turn both sides of an equation upside 
down if there is just one fraction on each side. For example we can say that 

2 4 3 6 
— = — so — = — . 

3 6 2 4 

What do you think we should do to help us rearrange 
1 1 1 
u v f 
if we can't turn it all upside down? 
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We can get rid of all the fractions by multiplying both sides of the 
equation by uvf. Then we have 

^/(I + i) = „„/(!) 

so, cancelling down, 
vf + uf= uv. 

We want a formula for v, so we put everything with a v in it on the 
same side of the equation. This gives uf= uv - vf so, factorising, 
uf= v(u -f). Now, dividing both sides by (u - f), we have 

uf 
v = 

" -/ 

which gives us the new formula for v that we wanted. 

We shall use exactly these same techniques for shifting stuff around when we find inverse 
functions in Section 3.B.(h). 

exercise 2.A.3 Try some rearranging of actual formulas for yourself now. 

(1) The surface area, 5, of a sphere of radius r is given by the formula 5 = t\nr 2 . Its 
volume, V, is given by V=\nr 3 . Rearrange these two formulas to give (a) the 
radius in terms of the surface area, and (b) the radius in terms of the volume. 

(2) The volume, V, of a closed cylinder of radius r and height h is given by the 
formula V = nr 2 h. Its surface area 5 is given by S = mr 2 + inrh. Rearrange 
these two formulas to give (a) the height in terms of the radius and the 
volume, and (b) the radius in terms of the height and the volume, and (c) the 
height in terms of the radius and the surface area. 

(3) v 2 = u 2 + 20s is a formula which relates the final velocity v to the initial 
velocity u of a body which travels a distance s with constant acceleration a. 

Find (a) a formula for a in terms of u, v and s, and (b) a formula for u in 
terms of v, a and s. 

(4) If two resistances, R^ and R 2 , in an electric circuit are arranged in parallel then 
they are equivalent to a single resistance R, with the relation between them 
being given by the formula 

111 
R R 1 R 2 

Find a formula which will give the value of R 2 in terms of R and R lt in the form 
R 2 = . . . Use this formula to find out what resistance should be put in parallel 
with a resistance of 3 Q to give an effective resistance of 2 Q. (Q is the symbol 
used for ohms, the unit in which resistance is measured.) 

2.B Introducing graphs 

It can be very helpful when thinking about how equations work if we can show them 
graphically, so that we can see what is happening in another way. I shall start by considering 
equations which can be shown by straight lines. This section is here in case you need any 
reminders on how to handle straight line graphs. I have put in another self-test here, so that 
you can see if you need to work through this. 



2.B Introducing graphs 
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2.B.(a) Self-test 4 

Try answering each of the following questions. 

(1) What are the coordinates of the midpoints of the straight lines joining 
(a) (2, -1) and (8, 5) (b) (-3, 1) and (2, -8)? 

(2) What is the steepness or gradient of the straight lines joining 
(a) (2, 5) to (8, 17) (b) (-1, 3) to (8, -6)? 

(3) What are the gradients of the following straight lines? 

(a) y = 3x + 4 (b) y + 4x = 2 (c) 2y = x - 4 (d) 3y + 4x = 0. 

(4) Find the equations of the following straight lines: 

(a) with gradient 2 and passing through (1,3) 

(b) with gradient -1 and passing through (2, -1) 

2 

(c) with gradient 5 and passing through (2,4) 

(d) passing through (2, 5) and (8, 10) 

(e) passing through (-4, -2) and (-1, 5). 

(5) What is the distance between each of the two pairs of points given in the first 
question? (Give your answers to two decimal places or d.p.) 

(6) Find the equations of the straight lines which pass through (1, 4) and are 
perpendicular to (a) y = 2x + 5 (b) 3y + 2x = 1 (c) Ay + x = 0. 

(7) What are the coordinates of the point which divides the straight line joining the 
points (1, 3) and (6, 18) in the ratio 2 : 3? 



Here are the answers which you should have. 

Give yourself one mark for each correct part of (1), (2) and (3), and two marks for each 
correct part of (4), (5), (6) and (7). 

(1) (a) (5, 2) (b) (4, -|) 

(2) (a) 2 (b) -1 

(3) (a) 3 (b) -4 (c) \ (d) -f 

(4) (a)j = 2x+l (b)j + x=l (c) 3j = 2x + 8 (d) 6> = 5x + 20 (e)3>> = 7x + 22 

(5) (a) v'72 = 8.49 to 2 d.p. (b) /106 = 10.30 to 2 d.p. 

(6) (a) 2y + x = 9 (b) 2y = 3x + 5 (c) y = 4x (7) (3, 9) 

As with the other self-tests, if you have less than 25 marks you should certainly work 
through this next section. Each particular point is dealt with here in the same order as the test 
questions, so it is also possible to go directly to any particular area where you need help. 

2.B.(b) A reminder on plotting graphs 

Here is a brief reminder of how graph plotting works. Suppose we have the equation y = 2x + 3 . 
Then, for each value of x that we might choose, there will be a corresponding value of y. The 
values of y depend on the values of x, and we call y the dependent variable and x the 
independent variable. We could show some of these pairs of values in a table, as below. 



X 


-2 


-1 


0 


1 


2 


3 


y 


-1 






5 




9 



Fill in the three missing y values yourself. 
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You should have 1, 3 and 7. 

We can write these pairs of values grouped together as (-2, -1), (-1, 1), (0, 3), (1, 5), 
(2, 7) and (3, 9). The independent value always comes first, and belongs to the variable 
which is plotted from side to side on a piece of graph paper, using the horizontal axis. 
The dependent variable is plotted from top to bottom, using the vertical axis. Because it 
matters what order we write these pairs of numbers in, they are often called ordered 
pairs. 

To plot them, we mark out a piece of graph paper with suitable scales to include all of 
the points which we are interested in. 

The point (0, 0) where the axes cross is called the origin. 

If the point P is (2, 7) then the numbers 2 and 7 are called the coordinates of P. 2 is its 
x-coordinate and 7 is its y-coordinate. 

The scales do not have to be equal. Here, it was more convenient to make the scale on 
the _y-axis smaller, and we get a graph which looks like the one in Figure 2.B.I. 




-2 



Figure 2.B.1 



It is important always to label the axes of your graphs with the letters of the variables you 
are using, so here I have labelled them x and y. 

I have joined the points with a straight line. I've done this because I am thinking that for 
every value of x there is a corresponding value of y, and all these points together make the 

3 3 

line. (For example, if x = then y = 6 and (5, 6) is also a point on the line.) 

When you plot a graph accurately on graph paper, you should use a well-sharpened 
pencil to mark each point with a small cross as accurately as you can. Then, if it is a straight 
line, draw this through the points in pencil. Of course, for any particular straight line, you 
only need to find two points, but it is always safer to work out three because this allows you 
to check your arithmetic if they turn out not to be in line. 

2.B.(c) The midpoint of the straight line joining two points 

To show this, I shall draw two diagrams for you. Figure 2.B.2(a) shows the special case of 
(l)(a) from the Self-test, and Figure 2.B.2(b) shows two general points which I shall call 

(x 1 ,y 1 ) and (x 2 ,y 2 ). 
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Figure 2.B.2 



If you find this at all difficult, I think it will help you to get a feeling of exactly what is 
going on if you use different colours on the two differently dashed lines. It may also help you 
to understand how everything fits in if you write in the measurements for the separate bits 
yourself. 

The midpoint in each case is found by taking the half-way or average value of the x values 
at either end of the line, and then doing the same for the y values. 
The midpoint in (a) is 



2 5 + (-l) 



which is (5, 2). 

The midpoint in (b) is 

'x x +x 2 y x +y 2 
2 ' 2 

We can now use this to find the midpoint of the line joining (-3, 1) and (2, -8). (This was 
question (l)(b).) We let (-3, 1) be y x ) and (2, -8) be (x 2 , y 2 ), which gives us the 
midpoint as 

-3 + 2 1 + (-8)\ f-l -7 
or 



2 2 

It would have worked equally well if we had taken (x Y , y x ) as (2, -8) and (x 2 , y 2 ) as (-3, 1). 
(Try it and see.) 

(If you have any problems with putting together the positive and negative numbers, you 
should go back to Section l.A.(e) in the first chapter. It will also help you if you make your 
own drawings of the pairs of points and their midpoints. Then you can actually see how the 
numbers are combining together to work.) 



The midpoint of the line joining (jc l5 y x ) and (x 2 , y 2 ) is given by 

'xi +x 2 y x +y 2 
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exercise 2.B.1 Find the coordinates of the midpoints of the straight lines joining these pairs of 

points. 

(1) (-3, 2) and (i, -6) 

(2) (-2, -1) and (3, 4) 

(3) (-1, -5) and (-4, -6) 

2.B.(d) Steepness or gradient 

Straight lines have the same steepness or gradient all the way along. This gradient can be 
measured by the distance moved vertically in the y direction for a unit distance moved from 
left to right in the x direction. If the line goes uphill from left to right so that this vertical 
distance is being measured in the positive direction up the v-axis, then the gradient is 
positive. If the line goes downhill from left to right then the vertical distance and the gradient 
are negative. We could think of the gradient as telling us the rate of change of y as x 
changes. 

Figure 2.B.3(a) shows the line joining (2, 5) and (8, 17) (question 2(a) from Self-test(4)), 
and Figure 2.B.3(b) shows the line joining the two points (x l5 y r ) and (x 2 , y 2 )- 




Figure 2.B.3 



The gradient in (a) is given by 

distance up 12 

— = — = 2 

distance along 6 



The gradient in (b) is given by 

distance up y 2 - y\ 



distance along x 2 - x x 

The gradient of a straight line is often written as the single letter m. Using this, we can now 
write down the following formula: 



The gradient, m, of the straight line joining (x ls Vi) to (x 2 , y 2 ) is given by 

yi-y\ 



m = 



X 2 — X\ 
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The m gives us the measure of how y is changing relative to x. We have already seen that 
the linej = 2x + 3 has a gradient of 2, with_y increasing twice as fast as x. Similarly, the line 
y = mx + c has a gradient of m. Rewriting the equation of any straight line in this form 
enables us to read off its gradient. For example, in question (3) of Self-test 4, the line (a), 
y = 3x + 4, has a gradient of 3. 

Line (b), y + 4x = 2, can be rewritten as y = -Ax + 2 so m, the gradient, is -4. 

Line (c), 2y = x - 4, can be rewritten as y = \x - 2 so m = \. 

4 -4 

Line (d), 3 y + 4x = 0, can be rewritten as y = -jx so m = — . 



We said in the previous section that if the equation of a straight line is written in the form 
y = mx + c them m is its gradient. What does the value of c tell us? 



If we put x = 0 we get y = c so the point (0, c) is where the line cuts the j-axis (its y 
intercept). For example, the line y = 2x + 3 cuts the _y-axis at (0, 3). 

If we know the values of m and c, we can use these to draw a sketch of the line. Figure 
2.B.4 shows three examples with sketches of (a) y = 3x + 1 so m = 3 and c = 1, 

3 

(b) y + x = 2 so y = -x + 2 and m = -1 and c = 2, (c) 4y = 3x + 4 so j = j x + 1 
and m = 4 and c = 1. 



EXERCISE 2.B.2 



Find the gradients of the following straight lines. 



(1) / = 3 - 5* 

(2) 2/ = 3X + 7 

(3) 3/ + x = 1 

(4) 4/ - 5X = 2 



2.B.(e) 



Sketching straight lines 





. 1 VY 

O 2 4^ 



<S> 3 + * = 2 



© 4j= 3x+4 



Figure 2.B.4 



EXERCISE 2.B.3 



Each of the following sketches in Figure 2.B.5 fits one of the lines whose 
equations are given below. Pair each equation up with its correct sketch. 
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Figure 2.B.5 

(1) y = x (2) y + 4* = 4 (3) 4/ = x + 4 (4) y = x - 2 
(5) y = 2x (6) y = x + 2 (7) y = \x (8) y + 2x = -2 



How can we write the equations of the lines shown in the four sketches in 
Figure 2.B.6? 



^3 



->x 



-2 



Figure 2.B.6 



The first sketch shows a line every point of which has a y-coordinate of 2, so it can be 
written as y = 2. (The value of x can be anything you like, since you can choose any point 
on this line.) Similarly, the second sketch shows y = -3. What do the third and fourth 
sketches show? 



The third sketch shows x = 3 and the fourth sketch shows x = -2. 
The lines in the first two sketches are flat, so their gradient, m, is zero. 
We can't write down the gradient for the last two lines because they are infinitely steep 
and we can't divide by zero. 
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2.B.(f) Finding equations of straight lines 

How much do you need to know to distinguish a particular straight line from all the other 
possible straight lines? 



You would either have to know two points which lie on it, or one point on it and its 
gradient. It is useful to be able to write down the equation of a straight line from either of 
these two starting positions. 

Figure 2.B.7 shows a straight line with gradient m passing through two known points which 
I have called (x 1 , y x ) and (x 2 , y?). We take (x, y) to be any general point on this line. 




Figure 2.B.7 



We have 

yi-y\ y-y\ 

= = m. 

JC\ X 

Two useful forms for the equation of a straight line come from this. 



Form (1) 


y- 


-y\ = 


m(x 


-x t ) 




Form (2) 


y 


-y\ 


x - 


-*\ 


yi 


-y\ 


x 2 


-x 1 



Form (2) comes from rearranging 

J2 -y\ _ y-yi 

X 2 — X\ X — X\ 

in the same way that we can rearrange 

8 6 6 9 

— = — as — = — . 
12 9 8 12 

example (1) Find the equation of the line with gradient j which passes through (3, 2). 
Substituting in form (1) gives y - 2 = \ (x - 3) so 2y = x + 1. 
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example (2) Find the equation of the line passing through (3, 2) and (9, 5). 
Substituting in form (2) gives 
y - 2 x - 3 

= so 6(y - 2) = 3(x - 3) and 2y = x+l. 

5-2 9-3 

Notice that this is the same line as we got from the first example. The reason for this is that 
I have chosen the points (3, 2) and (9, 5) because they fit nicely on Figure 2.B.7 above. If 
you have found any difficulty with the general rules in the two boxes above, you can feed 
these numbers in and mark the different numerical distances on the diagram to help you. 

For completeness, I also include the equation of a straight line written in the form 
y = mx + c which we have already used in Section 2.B.(d). This gives us 



Form (3) y = mx + c. 



Writing the numerical example of 2y = x + 1 in the form y = 2 x + 5, we have m = 5 and 

c = i 

exercise 2.B.4 Have another go at question (4) from Self-test 4 if you couldn't do it earlier. You 

should be able to do it now. 

2.B.(g) The distance between two points 

Suppose we need to find the distance D between the two points (x r , y x ) and (x 2 , j 2 ) as I nave 
shown in Figure 2.13.8(a). 




Figure 2.B.8 



We use Pythagoras' Theorem which says that: 



The distance between the two points (x l5 y { ) and (x 2 , y 2 ) is given by 

D 2 = (y 2 -y l ) 2 + (x 2 -x l ) 2 
so D = V(j>2 -y\f + (*2 -xif- 
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In the numerical example of Figure 2.B.8(b), this will give us 
D = V(4- l) 2 + (6-2) 2 = V3 2 + 4 2 = V25 = 5. 
(Pythagoras' Theorem is shown to be true in Section 4.A.(b).) 

exercise 2.B.5 Try question (5) of Self-test 4 again if you couldn't do it earlier. 



2.B.(h) The relation between the gradients of two perpendicular lines 

If we know the gradient of a line, surely it must be possible to write down the gradient of 
a line perpendicular to it. Suppose we start with the line y = \x. What is the gradient of any 
line perpendicular to this? 



We can see the way in which we can find the answer to this question by looking at Figure 
2.B.9 below. Figure 2.B.9(a) shows the special case of line (1) being y = \x and Figure 
2.B.9(b) shows the general case of line (1) having a gradient of p/q = m l , say. (I have only 
shown where the two lines cross each other in the two diagrams.) 




Figure 2.B.9 



In diagram (a), line (2) has a gradient of -2/1 = -2. (The minus sign is because the 2 is 
being measured downwards.) In diagram (b), line (2) has a gradient m 2 of -qlp. 
We see that the gradients of the two perpendicular lines multiplied together give 

1 

- X -2 = p/q X -qlp = -1. 



If two lines with gradients m, and m 2 are perpendicular, then m l m 2 = -1. 



exercise 2.B.6 Do question (6) from Self-test 4 again if you couldn't do it earlier. 



2.B.(i) Dividing a straight line in a given ratio 

In Section 2.B.(c) we found that the midpoint of the line joining (x l ,y l ) and (x 2 ,y 2 ) is given by 

*i +x 2 y 1 + y 2 \ 
2 ' 2 j 
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We now look at how to find the coordinates of a point which divides a line in any 
proportion or ratio. 

Figure 2.B. 10(a) shows the special case of question (7) of Self-test 4, where we are 
looking for the point which divides the straight line joining the points (1, 3) and (6, 18) in 
the ratio 2:3. 

Figure 2.B. 10(b) shows the point (x,y) which divides the straight line joining (x 1 ,y 1 ) to 
(x 2 , y 2 ) in the ratio p : q. We shall use this to find a general formula. 




Figure 2.B.10 



In (a), the point P is t of the way along line AB so each of its x- and ^-coordinates is given 

2 

by moving on from A by j of the total change from A to B. 

So we could say that P is given by (1 + j (6 - 1), 3 + 5 (18 - 3)) which is (3, 9). 
Similarly, we can see in (b) that P is given by 

P P \ 

*\ + (x 2 -x 1 ),y 1 + (J2-J1) • 

p + q p + q I 

This looks rather clumsy. Perhaps we can make it nicer if we put the whole of each 
coordinate over (p + q). Then we get 

_P_ ^ _ X! (p + q) +p(x 2 -x 1 ) 

p + q p + q 

_ x x q + x 2 p 
p + q 

and, similarly, the y coordinate of P is 

y\Q + yip 

p + q 

This gives us a much neater form for the coordinates of P. 
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The point P which divides (x x , y { ) and (x 2 , y 2 ) in the ratio p : q is given by 

/x l q+x 2 p y r q+y 2 p\ 
\ p + q ' p + q j 



Putting p = q in this formula gives us the same formula for the midpoint that we quoted 
at the beginning of this section. (Try it yourself, putting p = q = 1, and also p = q = 3, 
say.) 

When p and q are different from each other, they adjust the position of the point P by 
separately multiplying x x and x 2 , and y 1 and y 2 . 

^^^^ Notice that /; and q flip over so that it is g which multiplies x x and /; which 
^B-^V multiplies x 2 . 



example (1) If we use this formula to give the answer to question (7) of Self-test 4, 
shown in Figure 2.B. 10(a), we get 

/ 1 X 3 + 6 X 2 3 X 3 + 18 X 2\ 

P is given by — , — = (3, 9). 

V 2+3 2+3 / 



exercise 2.B.7 Find the coordinates of the points which divide 

(1) the line joining (-1, 2) and (5, 14) in the ratio 2 : 1, 

(2) the line joining (-2, -3) and (6, 9) in the ratio 1:3. 



2.C Relating equations to graphs: simultaneous equations 

2.C.(a) What do simultaneous equations mean? 

We now have two ways in which we can look at equations. We can find ways of solving them 
using algebra and we can also see what the meaning of these solutions is graphically. 
We will use this double approach first on pairs of equations like the following: 

2x + 3j = 5 (1) 

x - 2y = 6 (2) 

These are two equations which are true together, so that we have two pieces of information 
about the two unknowns, x and y. 

Such pairs of equations are called simultaneous equations. 

We could show these as two straight lines on a graph sketch. (See Figure 2.C.I.). To draw 
this sketch, I have rearranged 2x + 3y = 5 as y = -3 x + 5 and x - 2y = 6 as y = 2 x - 3. 
Then we can see that there is just one possible pair of values for x and y which fit both 
equations. These are the coordinates of the point where the two lines cross each other (here 
this is at about (4, -1)). 
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6 8 



Figure 2.C.1 



Does this mean that any two equations which give straight lines on a graph will have a 
solution which can be shown in this way? What might happen which would make this 
impossible? 



If the two lines have the same gradient so that they are parallel there will be no 
solutions which will fit both. (For example, there is no solution which fits 2x + 3y = 1 
and 2x + 3y = 5.) 

What happens if we have the two equations x - 2y = 6 and 2x - Ay = 12? 



We only actually have one piece of information here since the second equation is just the 
first one multiplied by 2, and so we have the same line drawn on top of itself. Every point 
on this line fits both equations and we therefore have an infinite number of possible 
answers. 

What happens if we have a third equation which we want to be true at the same time as 
the original pair? 

Geometrically, it is easy to see what happens. Either its line passes through the same 
crossing point as the other two, in which case it agrees with them or is consistent with them, 
but doesn't add any new information. Or its line does not pass through this crossing point 
at all. In this case, it is inconsistent with the other two equations, and the three equations 
cannot be simultaneously true. 



Although the graph method makes it easy to see what is happening, it can be very difficult 
to read off an accurate answer. A far simpler way to find this answer is to use algebra. There 
are various methods which can be used, and the best choice depends on the actual equations 
and comes with practice. I will show you two different ways of solving the pair of equations 
which were shown in Figure 2.C.1 above. 



2.C.(b) 



Methods of solving simultaneous equations 



2.C Relating equations to graphs 
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Method (A) Substitution. 

From equation (2), we have x = 2y + 6. 

We are looking for values of x and y so that both the equations are true 
together, so we can replace the Y in equation (1) by 2y + 6. We then have 

2(2y + 6) + 3y = 5 

so 4y + 12 + 3y = 5 

so ly = -7 

and y = -l. 

Now, substituting -1 for y in equation (2) we have 

x + 2 = 6 so x = 4. 

Checking in equation (1), LHS = 8-3 = 5 = RHS. 

I am again using the shorthand LHS for the left-hand side of an 
equation, and RHS for its right-hand side. 

Method (B) Elimination. Returning to the beginning, multiply equation (1) by 2 
and equation (2) by 3. Then we have 

f 4x + 6y = 10 (3) 

I 3x-6y = 18 (4) 

Adding equations (3) and (4) gives Ix = 28 so x = 4 and, by 
substitution, y = -1 as before. 

Method (B) could also have been done by multiplying equation (2) by -2. Then 

f 2x + 3y = 5 (3a) 

I -2x + 4y = -12 (4a) 
and adding equations (3a) and (4a) gives ly = -7 and y = -1 as before. 
Alternatively, you could multiply equation (2) by +2 and subtract. This gives 

f 2x + 3y = 5 (3b) 

I 2x - 4y = 12 (4b) 
Subtracting equation (4b) from (3b) gives ly = -7 and y = -I. 



9 



It is easier to make mistakes when subtracting negative quantities, so it is 
usually better to choose your numbers so that you can get rid of one of the 
letters by adding. 



It is likely, if a real-life situation is being modelled that we would have to solve more 
equations in more variables. If there is the same number of equations as there are variables, 
and provided we don't have a situation similar to the two equations being either parallel or 
just the same equation, as described above, then we can usually solve them by successive 
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elimination until just one variable is left. Once this is known, the other variables can be 
found in turn by substituting back into the equations. Such sets of equations, and their more 
complicated cousins in which the number of variables does not tally with the number of 
equations, can be dealt with more systematically by using matrix methods. 

Try solving these two pairs of simultaneous equations yourself before continuing. 



Qu(i) 



3x-2y = 21 (1) Qu(2) 
2x + 5y = -5 (2) 



x y 

3 2 

6x + y 



1-0 (1) 
8 = 0 (2) 



These are possible routes to solutions. 

For Qu(l), multiply equation (1) by 2 and equation (2) by -3. This gives 

6x - Ay = 42 (3) 

-6x-15y=15 (4) 

Equation (3) added to (4) gives -I9y = 51 so y = -3. 
Putting y = -3 in equation (1) gives 3x + 6 = 21 so x = 5. 
Now check in equation (2). LHS = 10 - 15 = -5 = RHS. 

In Qu(2), we start by getting rid of the fractions in equation (1) by multiplying by 6. Then 
we multiply equation (2) by 3. This gives us 

2x - 3y + 6 = 0 (3) 

18x + 3y + 24 = 0 (4) 

3 

Adding equations (3) and (4) gives 20x + 30 = 0 so 20x = -30 and x = -5. 
Putting this value in (2) gives -9 + y + $ = 0 so y=\. 
Checking in (1) gives LHS = -5-5+ 1 = 0 = RHS. 

Sometimes we can use these techniques in situations which at first sight don't look very 
promising. Here is an example. 



6 


2 


X 


y 


4 


3 


X 


y 



Our usual method is to get rid of fractions first. To do this, we would have to multiply 
equation (1) by 2xy and equation (2) by xy. Then we would have: 

I2y -4x = xy (3) 

4y - 3x = 0 (4) 

which looks rather unpleasant. 
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But if we put 

1 1 

X = - and 7 = - 

x y 

the original equations become 

f 6X-2Y=\ (3) 
I AX- 37= 0 (4) 

Then multiplying equation (3) by 2 and equation (4) by -3 gives 

f 12X-4Y=\ (5) 
[ -12X+ 97= 0 (6) 

Adding these two equations gives 57 = 1 so 7= \ and y = 5. Now (2) becomes 

4 3 4 3 20 

— - — = 0 so — = — so 20 = 3x and x = — . 
x 5 x 5 3 

Checking in (1) gives LHS = fj) - § = \ = RHS. 



EXERCISE 2.C.1 



Solve the following pairs of simultaneous equations. 
50 - 2b = 68 (l) f sp - 20 = 9 CO 



(i) 



(3) 



30 + b = 10 (2) 

x 5 
--/ = -- (1) 
8 2 



3* + - = 13 (2) 
3 



(2) 



(4) 



2p + 50 = -8 (2) 



- + - = 0 (1) 

x y 



2 2 

7 (2) 

x y 



2.D Quadratic equations and the graphs which show them 

Because quadratic equations have many applications, I have emphasised the particular 
aspects of them here which will help you later on. For this reason, I haven't started this 
section with a self-test. You will be able to check through quite quickly to see what is here, 
doing some of the exercises to be sure you understand. As usual, I am starting from scratch 
just in case some of you do need this basic help. 



2.D.(a) What do the graphs which show quadratic equations look like? 

So far, we have only looked at graphs of straight lines. These all have equations of the form 
y = mx + c where, as we have seen, m tells us the relative change in the y values for a given 
change in the x values, and c tells us where the line cuts the _y-axis. 
What effect will it have if we include an x z term as well? 
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We will look at y = x 2 - x - 6 as a first example and we start by making a table of some 
values below. 

y = x 2 - x - 6 



X 


-3 


-2 


-1 


0 


1 


2 


3 


4 


y 


6 


0 


-4 




-6 




0 





(Fill in the three missing ones yourself.) 



You should have -6, -4 and 6. 
If we plot these pairs of values we will get the graph I show in Figure 2.D.I. 




Figure 2.D.1 



Clearly, this is not a straight line. Because of the x 2 , the y values no longer change evenly 
in proportion to the x values. 

If we join the points smoothly, we get a curve. (We can justify doing this because working 
out intermediate values gives us more points which lie on the same curve.) This curve that 
we get is called a parabola. 

Factorising as we did in Section l.B.(c), we can also say that 

x 2 - x - 6 = (x - 3)(x + 2). 

Now, if y = 0 then x 2 - x - 6 = (x - 3)(x + 2) = 0. 

x 2 - x - 6 = 0 is an example of a quadratic equation. 

We can see from the graph that_y = 0 when x = 3 or x = -2. We also see that each of these 
values for x makes one of the brackets (x - 3) and (x + 2) equal to zero. 

If two numbers multiplied together give zero, then one of them must itself be zero. (There 
is no other number which behaves like this; we saw in Section 1 .E.(c) that there are infinitely 
many pairs of numbers which multiply together to give the number 1 , and the same is true 
for any other number but zero. Zero drops any number it multiplies into a black hole of 
zero.) 

We now use this special property of zero to find solutions for quadratic equations like 
x 2 - x - 6 = 0 directly by algebra, without having to draw a graph. 
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For example, suppose we have the equation x 2 - x + 12 = 0. 
Factorising, we get 

x 2 -x + 12 = (x-4)(x + 3) = 0. 
Therefore, either x - 4 = 0 giving x = 4, or x + 3 = 0 giving x = -3. 

^^^^ Notice that the signs of the solutions for x are the opposite of the signs in the 
^H-^V corresponding brackets. 

(If you need help with factorising, go back to Section l.B.(c) in Chapter 1.) 



exercise 2.D.1 Try solving these for yourself. 

(l) x 2 + gx + 14 = o (2) x 2 + 4* - 12 = o (3) x 2 - lix + 18 = 0 
(4) x 2 - x - 20 = o (5) 2x 2 + 13X + 6 = 0 (6) 3X 2 - jx - 6 = o 

Sometimes, with an equation involving x 2 , it is easy to write down the answers without 
factorising. For example, the equation x 2 = 16 can be solved simply by taking the square root 
of both sides. 

If x 2 = 16 then x = ±4. (The sign ± means 'plus or minus'.) 

^^^^ Don't forget the -4 which comes because (-4) 2 = 16 as well as (+4) 2 . Notice, 
^H-^V too, that you only need the ± one side; putting it both sides will just give you 
the same pair of answers twice over. 



So that you can see that we get the same answers, I will also show you how to solve this 
equation by factorising. 

We would say x 2 - 16 = 0 so (x - 4)(x + 4) = 0 so x = ±4. This factorising is another 
example of the difference of two squares. 

Now I shall take the slightly more complicated equation of (x + 2) 2 = 16 as a second 
example. 

Again, we square-root both sides. This gives us the following working: 
(x + 2) 2 = 16 so x + 2 = ±4 so x = 2 or x = -6. 
This is quicker than the working needed for factorising which goes 

(x + 2) 2 =16 so x 2 + 4x + 4 = 16 so x 2 + 4x - 12 = 0 
so (x - 2)(x + 6) = 0 so x = 2 or x = -6. 

exercise 2.D.2 Solve the following equations yourself. 

(1) x 2 = 9 (2) x 2 = % (3) (x - 3 ) 2 = 4 

(4) (2x - 3 ) 2 = 25 (5) (3* - 2) 2 = 36 
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2.D.(b) The method of completing the square 

There is another way of finding the solutions for quadratic equations which is called 
completing the square. This method may seem clumsy at first, but it is worth persevering 
with it because it has other very useful applications. In particular, we shall use it to handle 
the equations of circles in Section 4.C.(d), Section 8.F.(a) and Section lO.E.(c). We shall 
also use it in Section 9.B.(d) to help us with integration, and in Section 2.D.(d) to show 
how we get the 'formula' for quadratic equations. Finally, we shall need it in the next 
section to help us to sketch graphs, so altogether we see that it will be worth the effort 
we put into it. 

The following example shows how it works. 

Suppose we have the equation x 2 + 6x - 16 = 0. Then either we can say 

x 2 + 6x - 16 = (x + 8)(x - 2) = 0 so x = -8 or x = 2, 

which is the method that we have been using so far, or we can rearrange the equation so that 
it looks like the equation (x + 2) 2 = 16 which we solved in the previous section. 
We do this as follows. We have 

x 2 + 6x - 16 = 0 so x 2 + 6x = 16. 

Now we say that x 2 + 6x could have come from (x + 3) 2 except that (x + 3) 2 gives us an 
extra term of 9 since (x + 3) 2 = x 2 + 6x + 9. 

So, taking account of this, we can replace x 2 + 6x by (x + 3) 2 - 9. We have written 
x 2 + 6x by completing a square and then taking off the extra +9 which this has given 
us. 

The equation now becomes 

(x + 3) 2 - 9 = 16 so (x + 3) 2 = 25. 

Square-rooting both sides, as we did in the last section, we have x + 3= ±5 so x = 2 or 

x = -8. 

Here is a second example in which I have shown the working more briefly. 
I will solve the equation x 2 - 2x - 3 = 0 by completing the square. 

x 2 - 2x - 3 = 0 so x 2 - 2x = 3 but x 2 - 2x = (x - l) 2 - 1 
Therefore we have 

(x - l) 2 - 1 = 3 so (x - l) 2 = 4. 
Square-rooting both sides gives us 

x - 1 = ±2 so x = 3 or x = -l. 

We see from this and the previous example that all we have to do to get the correct bracket 
for completing the square is to halve the coefficient of x. In the first example, we halved 6 
to get 3, and in the second we halved -2 to get -1. 

We must also remember to take off the extra bit which we have added on by squaring the 
bracket. These were 3 2 = 9 in the first example, and l 2 = 1 in the second. 

exercise 2.D.3 Now try solving these three quadratic equations yourself by completing the 

square. 

(l) x 2 + 4x = 2i (2) x 2 - 6x + 8 = o (3) x 2 - 3X - 10 = o 
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2.D.(c) Sketching the curves which give quadratic equations 

The method of completing the square gives us a neat way of sketching the curves connected 
with quadratic equations. We shall now look at how this is done by taking y = x 2 - 2x - 3 
as an example. 

We can rewrite x z - 2x - 3 as 

(x - l) 2 - 1 - 3 or (x - l) 2 - 4. 

Using this rewritten form of y = (x- l) 2 - 4, what is the smallest possible value which y can 
take, and what value of x makes this happen? 



Since we can't get a negative result when we square a number, the smallest possible value 
of (x - l) 2 is zero, and this happens when x = 1. So the smallest possible value of y is -4 
and the lowest point on the curve of y = x 2 - 2x - 3 has the coordinates (1, -4). 

As the values taken by x move further and further away either side from x = 1 , the value 
of y becomes increasingly large since the value of x 2 becomes increasingly large. (It very 
soon swamps out the effect of the -2x - 3.) If you are unsure about this behaviour of y, test 
it for yourself using your calculator by choosing pairs of values of x symmetrically placed 
either side of x = 1. The further away you go, the larger the value of y becomes. 

We can also use two other pieces of information to help us to draw the sketch of 
y = x 2 - 2x - 3. 

The first is the value of the y-intercept, that is, the place where the curve crosses the 
j-axis. For this curve, this is (0, -3), since y = -3 when x = 0. 

The second is the values of x for which y = 0. These are called the roots of the equation 
y = 0. Here, putting 

y = (x - l) 2 - 4 = 0 

gives 

(x - l) 2 = 4 

so x - 1 = ±2 giving x = 3 or x = -l. 

We can now draw a sketch of the parabola y = x 2 - 2x - 3 using all the information which 
we have found above. I show this in Figure 2.D.2. 




Figure 2.D.2 
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^^^^ The roots are the values of x which are the solutions of the equation 
^B-^V a- 2 - 2x -3 = 0. It is very important to remember to write this as an 

equation by including the '= 0'. The expression x 2 - 2x - 3 on its own can 
have infinitely many values, some of which are shown by the y values in the 
graph sketch of y = x 2 - 2x - 3 shown above. 

Notice that all the important information is clearly labelled on the graph. 



What will happen if we have to sketch a graph which starts off with -x 2 ? 

For instance, what happens if we sketch y = -x 2 + 2x + 3 (the same as the one which we have 
just done, but with all the signs changed? Try doing this for yourself before reading on.) 



The whole curve is simply turned upside down, because each positive value for y is 
changed to the corresponding negative value, and vice versa. 

The roots of x = -1 and x = 3 are still the same, but now the highest point is given by 
(1, +4), and the ^-intercept is (0, 3). 

If you weren't able to sketch it before reading this, sketch it on top of my graph of 
y = x 2 - 2x - 3 now. 

Whenever we have an equation for y which starts with a negative quantity of x 2 , we will 
get an upside-down or inverted U-shaped curve like this one. (The negative changes the 
smiley parabola into a sad parabola.) 

exercise 2.D.4 Try using the same techniques to sketch the following two pairs of graphs. 

(1) (a) y = x 2 - 4* + 3 (b) y = -x 2 + 4X - 3 

(2) (a) y = x 2 + 2x - 8 (b) y = -x 2 - ix + 8 

(The general rules for sketching curves like this are given at the end of Section 
2.D.(f) as they also involve results which come from the formula for solving 
quadratic equations.) 

2.D.(d) The 'formula' for quadratic equations 

So far, all the quadratic equations we have looked at have turned out to have roots which are 
either whole numbers or fractions. Surely this will not always be true? The square roots of 
most numbers cannot be written as exact fractions or whole numbers. (In Section l.E.(d) we 
showed that v 2 can't be written in this way.) 

Also, how can we tell if the curve of a particular equation never actually crosses the 
x-axis without drawing it? 

It will be much easier for us to answer these questions if we can find a general rule for 
solving quadratic equations. Then we shall be able to see exactly what makes particular 
problems arise. 

We start with ax 2 + bx + c = 0 with a, b and c standing for numbers and a * 0. 
We want to find a formula from this which will give us a rule for finding the possible 
values of x if we know the values of the numbers a, b and c. 

First, we divide through by a as it is easier then to complete the square. Then we have 

b c b c 

x H — x H — = 0 so x H — x = - — . 
a a a a 
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Now we complete the square, halving the coefficient of x, and taking off the square of this 
amount just like we did in the numerical examples in Section 2.D.(b). This gives us 

2a I \2a ) a \ 2a j \ 2a j a 

b \ 2 b 2 c ( by b 2 - 4ac 

so | x + — = — so [x + — = . 

2a I 4a 2 a \ 2a) 4a 2 

Now, taking the square root of both sides, we get 




b 2 -4ac ±^b 2 - 4ac 
4a 2 2a 



b ijb 2 - 4ac 

so x = - — ± . 

2a 2a 

Finally, we get 



-b ± ^b 2 - 4ac 

x = . 

2a 

This is the so-called 'formula' for solving quadratic equations. 



If you have seen this before, you may have realised that the right-hand side of the above 
working was growing more and more familiar. 

All we have to do to make use of it is to substitute the values of a, b and c from the 
particular equation that we want to solve. 

For example, to solve 2x 2 - 5x + 1 = 0 we put a = 2, b = -5 and c = 1. Then 

+5 ± J25 - 4(2)(1) 5 ± Jll 

x = = = 2.28 or 0.22 to 2 d.p. 

4 4 

Because v 17 is irrational, that is, it has no exact square root, it would not have been possible 
to factorise this equation in any simple way. 

Even equations which can be solved by factorising are often more easily dealt with by 
using the formula, if the factorisation is at all difficult. 

For example, the equation 12x 2 + I9x -18 = 0 will factorise into brackets with whole 
number coefficients. We know that this is possible from working out the value of 'b 2 - 4ac\ 
Here b = 19, a = 12 and c = -18, so b 2 - 4ac = 1225 = (35) 2 . (The number 1225 is called 
a perfect square because it has an exact square root.) 

In fact, 1 2x 2 + 1 9x - 1 8 = (4x + 9) (3x - 2) but these brackets may not spring immediately 
into your head. Substitution into the formula gives 

-19 ± 35 9 2 

x = = - — or — 

24 4 3 

just as we would obtain from the factorised form. So the equation 12x 2 + 19x -18 = 0 has 
the two roots or solutions of - 4 and 3. 
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exercise 2.D.5 Use the formula to solve the following quadratic equations. (If the answers are not 

exact fractions, give them correct to 2 d.p.) 

(1) x 2 + iox + 16 = o (2) x 2 - 2x - 8 = 0 (3) 2x 2 + 5X - 3 = 0 
(4) x 2 + 4X + 2 = o (5) 3X 2 - x - 2 = o (6) 2X 2 - x - 7 = o 




You should try this now as you will need your answers for the next section. 

(a) For each equation which you have just solved, find what you get if you 
add the two solutions or roots together. Can you connect this answer 
with the a, b and c of the particular equation in any way? 

(b) Now find what you get if you multiply each of the pairs of roots 
together. Then again see if you can connect the results with the a, b and 
c of the particular equation. If your answers aren't exact fractions or 
whole numbers, you will find that the more decimal places you take, the 
closer you will get to a nice result, because you will be lessening the 
rounding errors. 

(c) Now for the tricky bit. Can you see why you are getting these neat 
results from adding and multiplying the pairs of roots even when the 
roots themselves are not simple numbers? Try looking at how your 
working went when you used the formula to get your two answers. 



2.D.(e) Special properties of the roots of quadratic equations 

This section is based on your answers to the thinking point at the end of the previous 
section. 

When you add the pairs of roots for each of the equations in Exercise 2.D.5, you should 
find each time that you get the answer of -bla for that equation. 

For example, in question (3), the two roots are 5 and -3, and a = 2, b = 5 and c = -3. 
Adding the roots gives \ - 3 = -l\ = -§. 

We can see exactly why this should be so by looking at the roots of the equation 
ax 2 + bx + c = 0. These are 

-b + ^Jb 2 - 4ac -b - ^Jb 2 - 4ac 

and . 

2a 2a 

Splitting each of them into two parts and adding them gives 

l-b V* 2 -4ac\f-b ^b 2 -4ac\ -b -b b 

\2a 2a ) \2a 2a ) 2a 2a a 

The two complicated bits have cancelled out. 

When you multiply the pairs of roots for each of the equations in Exercise 2.D.5, you 

should find that you get the answer of +c/a for that equation. (For example, in question (3) 

1 3 

you get 2 X -3 = -5. The minus agrees with c being negative here.) 

We can see why this happens if we multiply the two roots of ax 2 + bx + c = 0 together, 
though it's a bit more complicated this time. We have 

l-b ^b 2 -4ac \ l-b ^b 2 - 4ac \ _ l-b\ 2 l^jb 2 -4ac \ 2 

\2a 2a ) \ 2a 2a / \2a / \ 2a } 
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The two middle bits have cancelled out, because of the + and - signs. This is the difference 
of two squares of Section l.B.(b) again. Tidying up gives us 

b 2 (b 2 - 4ac) 4ac c 

4a 2 4a 2 4a 2 a 

When we either add or multiply any pair of roots, we get rid of the square root of the number 
b 2 - 4ac. We therefore also get rid of any complications which might arise from trying to 
find this square root. 



Two special properties of the quadratic equation ax 2 + bx + c = 0 

• Adding its two roots together gives -bla. 
This is called the sum of the roots. 

• Multiplying its two roots together gives da. 
This is called the product of the roots. 



We shall also get this same pair of results by following a different route in Section 
2.D.(h). 



EXERCISE 2.D.6 



This is an exercise of mixed questions on solving quadratic equations. If the 
answers to any question are not exact, give them correct to three decimal places. 

(i) Solve these in whatever way seems suitable. 

(a) 2x 2 + jx + 3 = o (b) 3x 2 + 4x + i = o (c) 2x 2 + X-4 = o 

(d) 6x 2 -jx+ 2 = 0 (e) x 2 -5x + 3 = 0 (f) 6x 2 + 5X- 6 = o 

(g)x 2 -8i = o (h) 6x 2 -x-i2 = 0 (i)x 2 -2 = o (j) x 2 -sx = o 

Check that the sum and product of the roots of each equation do fit the results 
given in the box above. 



(2) Solve the following equations. 



(a) 



2X - 3 



X - 1 



2X + 3 X + 1 



(b) 



y + 1 y - 1 



3 2X + L 

- (0 

y x + 1 



x- 8 



2X - 1 



2.D.(f) Getting useful information from 'b 2 - i^ac' 

From the quadratic equations which we have solved and the work of the last section, we have 
seen that it is having to find the square root of b 2 - 4ac which can make us sometimes get 
complicated answers. 

The b 2 - 4ac in the quadratic equation formula works as a kind of litmus paper or probe 
to tell us what kind of roots any particular equation will have. 
We look now at the different possibilities. 

(1) If b 2 - 4ac is positive then the equation will have two distinct roots. Geometrically, 
the curve of y = ax 2 + bx + c cuts the x-axis in two separate places. 

If b 2 - 4ac has an exact square root, then the two roots will be either whole 
numbers or fractions. This means that it must be possible to solve the equation by 
factorising and so gives a good quick test for this. 
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(2) If b 2 - 4ac = 0 then the two roots will come together as one root. For example, 

6 ± V36-36 

if we have x - 6x + 9 = 0 then x = = 3. 

2 

Also 

x 2 - 6x + 9 = (x - 3)(x - 3) = (x - 3) 2 . 

It is as though we have the root of 3 repeated twice. Geometrically, this is 
because y = (x - 3) 2 just touches the x-axis when x = 3. (See Figure 2.D.3.) The 
usual two roots have met up together to make just one root. 

Two roots One repeated root No roots 




Figure 2.D.3 

We shall use this property geometrically in Section 4.C.(e). 

(3) If b 2 - 4ac is negative, we cannot find a square root for it. The curve of the equation 
does not cut the x-axis at all. It is either completely above or completely below it so 
there are no values of x on the x-axis which fit the equation y = ax 2 + bx + c = 0. 

For some purposes, this lack of roots is not very satisfactory, and we cleverly 
get round it in Chapter 10 by inventing a new sort of number. 



A summary of everything that we now know which will help us to sketch 
curves of the form y = ax 2 + bx + c 

• If a is positive, the curve is U-shaped. 

If a is negative, the curve is an upside-down U. 

• The value of c tells us the _y-intercept. 
The curve crosses the j-axis at (0,c). 

• We can factorise (or use the formula) to find whether and where the curve cuts 
the x-axis. 

If b 2 - Aac is negative, the curve does not cut the x-axis at all. 

• We can complete the square to find where the least value of the curve is (or the 
greatest value, if it is an inverted U-shape). We shall see in Section 8.E.(b) that 
this can also be found by using calculus. 

If the curve does cut the x-axis, substituting the midway value of x between 
the cuts into the equation for y gives the least value of y (or the greatest value 
of y if the curve has an inverted U-shape). 
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exercise 2.D.7 Each of the six sketches shown below in Figure 2.D.4 comes from one of the ten 

curves whose equations are given. Fit each sketch to its correct equation, and then 

draw your own sketches for the four equations which are left over. 

(1) y = x 2 + 6x + 5 (2) y = x 2 - 6x + 5 (3) y = x 2 (4) y = -x 2 

(5) y = x 2 - 4X + 4 (6) y = i\x - x 2 - 4 (7) y = x 2 - 8x + 16 

(8) y = x 2 + 1 (9) y = x 2 - 3X - 4 (10) y = 3X + 4 - x 2 




Figure 2.D.4 



2.D.(g) A practical example of using quadratic equations 

s = ut — 5 g* 2 is a formula which gives the distance s in metres travelled by a ball from the 
thrower's hands if it is thrown upwards with an initial velocity of u m s" 1 (metres per second), 
after a time of t seconds, g is the acceleration due to gravity and is 9.8 m s~ 2 (metres per 
second per second) to 1 d.p. 

We shall now use this formula to answer the following questions. 

(1) If a rubber ball is thrown upwards at 14 m s~\ how high has it gone after 1 second? 

(2) How long does it take for the ball to reach a height of (a) 5 m, (b) 10 m, (c) 15 m 
from the thrower's hands? 

(3) Using the information you have now found, draw a sketch showing the relation 
between s and t. 

(4) How long does the ball take to fall back into the thrower's hands, which we will 
assume are ready and waiting? 

(5) Where is the ball after 2.9 seconds? 



(1) Using s = ut- \gt 2 , we have u = 14, t = 1 and g = 9.8 so s = 14 - (9.8/2) = 9.1; 
the ball has reached a height of 9.1 metres after 1 second. 

(2) (a) Putting s = 5, we have 5 = 14* - (9.8/2)f 2 so 4.9t 2 - Ut + 5 = 0. Solving this 

using the formula for quadratic equations gives 

14 ± ^196 - 98 14 ± V98" 
9.8 9.8 
which gives t = either 2.4 or 0.4 to 1 d.p. 
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(b) Putting s = 10 gives 10 = 14? - 4.9t 2 so 4.9? 2 - 14? +10 = 0. 
Again using the formula, we get 



14 ± V196 - 196 

? = = 1.4 to 1 d.p. or 1.43 to 2 d.p. 

9.8 

(c) Putting 5=15 gives 15 = 14? - 4.9? 2 so 4.9? 2 - 14? +15 = 0. 
Using the formula gives 



14 ± ^196-294 14 ± 4~9% 

9.8 9.8 

Because we have a negative square root here, it is impossible to find any value 
of ? on the horizontal ? axis which fits this equation. 

What is the physical meaning of the three answers we have found for question (2)1 

• Why are there two possible times to reach a height of 5 metres? 

• Why is there just one time to reach a height of 1 0 metres? 

• Why couldn't we find a time to reach a height of 1 5 metres? 

Try answering each of these questions yourself. 



The ball reaches a height of 5 metres from the thrower's hands both on the way up and 
on the way down, so there are two possible answers for the time. 

The single answer for the time taken to reach 10 metres means that this was the highest 
point the ball reached. So it never reached a height of 1 5 metres and it was impossible to find 
a time for this. 

The mathematics of the quadratic equations has exactly corresponded back to the 
physical situation. 

(3) With this information we can now draw a sketch of the relation between s and t. 
I show this below in Figure 2.D.5. 
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Notice that the graph sketch shows the height of the ball after time t. The little 
sketch at the side shows the actual path of the ball which is straight up and then 
straight back down. 

(4) Because the curves giving quadratic equations are symmetrical, if we know that the 
time taken for the ball to reach its highest point is 1 .4 seconds, then the time taken 
for it to fall back into the thrower's hands will be 2.8 seconds. 

(5) Clearly, from the sketch, after 2.9 seconds the ball should have been safely caught. 
If we put t = 2.9 in s = ut - j gt , we get s = -0.6 to 1 d.p. This describes what has 
happened to the ball if the thrower completely misses it and it just carries on 
downwards. It will be 0.6 metres below the thrower's hands after 2.9 seconds. 

Now see if you can answer this question. 

What is the meaning of the quadratic equation 0 = ut - \ gt 2 ? 



Solving this equation tells us when the ball is in the thrower's hands, that is, when 5 = 0. 
Factorising, we have 

0 = Ut - J gt = t(U - 2 gt) 

so either t = 0 (the ball is just about to be thrown up) or u - j gt = 0 so t = 2 u/g which is 
the time taken for the ball to return to the thrower's hands. When u = 14, t = 2.86 = 2 X 1.43 
seconds. Strictly speaking, the time of 2.8 seconds is an underestimate. 

The above working has ignored air resistance. It describes the motion of a rubber ball 
quite well but would be of no use to describe the motion of a feather. We are using the 
formula s = ut -j gt 2 as a mathematical model we can work with and which approximates 
quite well to the actual physical situation. 



If the ball is thrown up at 14 m s _1 we know that s = 14? - j gt 2 . 
Therefore we know the ball's height at any time during the throw. 
Surely, if we know this, we ought to be able to find out how fast it is 
moving at any particular time? 

See if you can answer these questions. 

(1) When does the ball move fastest? 

(2) When does it move slowest? 

(3) Can you estimate how fast it is going one second after it has been 
thrown up? 

(These questions will be answered in Section 8. A. (a) later on but it would be 
very good for you to think about the possibilities yourself here.) 



2.D.(h) All equations are equal - but are some more equal than others? 

In the last section, we looked at some of the physical meanings which equations can hold. 
We will end this chapter by spending some time examining the equations themselves. 

Do equations always work in the same kind of way, so that by solving them we find some 
specific answers which fit these particular circumstances? 
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Or, if not, what else can happen? 

The following examples all look straightforward at first sight, but try solving each of 
them yourself. Things are not always quite as they seem. 

(1) x 2 + 5x + 6 =x 2 +x-2 (2) x 2 -x - 6 = x 2 + 3x - 4 

(3) 2x 2 - 8x + 8 = x 2 - Ax + 5 (4) x 2 - 6x + 8 = (x - 2) (x - 4). 

It will help you to see what is happening if you also sketch the graph of each side of each 
equation. Then you can see whether, and if so where, these graphs cross. 
You should try doing this for yourself before looking at my solutions. 



(1) x 2 + 5x + 6 = x 2 + x - 2 so 4x = -8 and x = -2. 

To show this single solution graphically, we sketch, using the same axes, 
(a) y = x 2 + 5x + 6 = (x + 3)(x + 2) and (b) y = x 2 + x - 2 = (x + 2)(x - 1). 
The sketch in Figure 2.D.6 shows that y = 0 for both (a) and (b) when x = -2. 




Figure 2.D.6 



(2) x 2 - x - 6 = x 2 + 3x - 4 so -2 = 4x and x = -\. 

The sketch in Figure 2.D.7 of (a) y = x 2 - x - 6 = (x - 3) (x + 2) and 

(b) y = x 2 + 3x - 4 = (x - 1) (x + 4) shows that there is the single solution of 

x = -\ which gives equal y values for both (a) and (b). 




2.D Quadratic equations and their graphs 



73 



(3) 2x 2 - 8x + 8 = x 2 - 4x + 5 so x 2 - 4x + 3 = 0 so (x - 3) (x - 1) = 0 and x = 3 or x = 1 . 
The sketch in Figure 2.D.8 of (a) y = 2x 2 - 8x + 8 = 2(x 2 - 4x + 4) = 2(x - 2) 2 and 
(b) y = x 2 - 4x + 5 = (x - 2) 2 - 4 + 5 = (x - 2) 2 + 1 shows the two possible values of x 
which make the y values of (a) and (b) the same. These are x = 1 and x = 3. 




Figure 2.D.8 

(4) x 2 -6x + 8 = (x-2)(x-4) 

Multiplying out the right-hand side gives exactly the same expression as the left- 
hand side. Therefore, any value of x is a possible solution since it will make each 
side of (4) have the same value. The two graphs lie on top of each other - they are 
the same graph. I show this in Figure 2.D.9. 

GUI points on thts curve 




Figure 2.D.9 

What we have here is not an ordinary equation but just two different ways of writing the 
same piece of information. The two sides are identically equal to each other (rather like 
identical twins). We call an equation like this an identity. 

Just like identical twins, the two sides are equal in every detail, so there are the same 
number of x 2 terms on both sides of the '=' sign, and the same number of xs. The number 
terms on each side are also equal. This is the only way that the two sides can remain equal 
to each other for all possible values of x. 

Remembering that the number which tells you how many you have of x 2 , say, is called 
its coefficient, we see that comparing the coefficients will give us three equal pairs of 
values. 
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If two expressions are identically equal to each other, the coefficients of each separate 
power of x on each side of the ' = ' sign must be the same as each other. 



This rule gives us a very neat method of finding out how to write expressions in different 
ways. We'll use it in the next section to factorise expressions which involve terms with x 3 , 
and then later on in Section lO.D.(c) to find complex roots of equations. Also, we'll see in 
Section 6.E.(d) that it will make finding some kinds of partial fraction much easier. 

I'll now finish this section by showing you how to use this rule to find the special 
properties of the sum and product of the roots of quadratic equations. We have already found 
these properties in Section 2.D.(e) by working directly from the roots themselves, but this 
new method will avoid the tricky algebra which we had to use there. 

Suppose that the equation ax 2 + bx + c = 0 has the two solutions x = a and x = so that 
its two roots are a and (a and j3 are the Greek letters for a and b and are called alpha and 
beta. They are very often used to stand for the roots of quadratic equations.) 

We start by dividing both sides of the equation ax 2 + bx + c = 0 by a. This gives us 

x 2 + -x + - = 0. 
a a 

(We do this division because it will simplify the working which follows.) 
Now, (x - a) (x - p) = 0 is just another way of writing 

x 2 + (b/a)x + da = 0. 

Also, 

(x - a) (x - /?) = x 2 - ax - fix + afi = x 2 - {a + /?) x + a/3 

so y = x 2 - (a + j3) x + a/3 gives exactly the same curve as y = x 2 + (b/a)x + da. 

(The earlier division by a means that we now have two curves which are identical for 
every value of x. You can see exactly how this works if you take the numerical example of 
2x 2 -6x + 4=x 2 -3x + 2 = 0 which has the two roots x = 1 and x = 2.) 

We already have matching terms of x 2 on both sides. 

Comparing the coefficients of x (which must also be equal), we have 

b b 
-(a +{$) = - so a+P = --. 

a a 

Also, comparing the two number terms, we have aft = da. 
This gives us the following two rules. 



If we have the quadratic equation ax 2 + bx + c = 0, 

then the sum of its roots = - bla and the product of its roots = da. 



A note on writing identities 

The special form of equality called an identity in maths, where the two sides of the 
expression remain equal for all possible values of x, is sometimes written using the triple 
equals sign ' = '. You can think of the sign ' = ' as meaning 'is the same as' or 'is equivalent 
to'. Mathematicians often speak of the two sides as being identically equal to each other. 
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2.E Further equations - the Remainder and Factor Theorems 

2.E.(a) Cubic expressions and equations 

How could we set about solving an equation like 2x 3 - 5x 2 - 6x + 9 = 0? 

This is called a cubic equation since the highest power of x is x 3 . There isn't a very simple 
formula for solving cubic equations, so we see if we can successfully guess one answer to 
start us off. (The following method will only work for equations which have exact solutions 
which are also not too hard to guess; if this is not the case, other methods involving closer 
and closer approximations to the true solutions would have to be used.) 

Here, if we try putting x = 1, we get 2x 3 - 5x 2 - 6x + 9 = 2- 5- 6 + 9 = 0, so we 
immediately have one solution of our equation. 

It will make the working much shorter and easier to follow if we now introduce a 
shorthand way of describing 2x 3 - 5x 2 - 6x + 9. We will call it f(x), with the name f(x) 
meaning this particular collection of terms whose value changes as x changes. 

This gives us a neat way of showing particular values of fix) associated with their 
corresponding values of x. 

For example, if x = 2 we have/(2) = 2(2 3 ) - 5(2 2 ) - 6(2) + 9 = -7 so f(2) = - 7. 

(In fact, f{x) is what is called a function of x. In Section 3.B, we shall look at what 
functions are in more detail.) 

We can now say that fix) = 2x 3 - 5x 2 - 6x + 9 and we know that f{\) = 0. 

Since x = 1 is a solution or root of this equation, it seems reasonable to think that (x - 1) 
must be a factor of fix), just as we found with quadratic equations. 

(We will show that it is all right to say this in Section 2.E.(c).) 

If (jc - 1) is a factor, we can say that 

fix) = 2x 3 - 5x 2 - 6x + 9 = ix - 1) (something). 

Since the right-hand side is just another way of writing the left-hand side, the two sides must 
be exactly the same as each other. Therefore we must have the same matching quantities of 
x 3 , x 2 , x and numbers on both sides. This means that it is easy to match up the two end terms 
in the right-hand bracket. It is just the middle one which will take a bit more thought. We 
can say 

2x 3 - 5x 2 - 6x + 9 = (x - l)(2x 2 + px - 9) 

where p is standing for the number which we haven't found yet. 

Now, matching the terms in x 2 , we have -5x 2 = -2x 2 + px 2 , picking out the ways in which 
we can get x 2 on the right-hand side. 

Therefore, -5 = -2 + p so p = -3. 

We can check that this is correct by matching the terms in x. 

Doing this gives us -6x = -px - 9x which does indeed work for p = -3. 

So now we can say 2x 3 - 5x 2 - 6x + 9 = (x - l)(2x 2 - 3x - 9). 

What we have here is an example of an identity, like the ones which we described in 
Section 2.D.(h) where we also matched up terms in this way. 

We can find the other two solutions or roots of the equation /(x) = 0 by solving 
2x 2 - 3x - 9 = 0. Factorising, 

2x 2 - 3x - 9 = (2x + 3)(x - 3) = 0 

so x = 3 or -j. 

The three solutions or roots of fix) = 2x 3 - 5x 2 - 6x + 9 = 0 are therefore given by x = 1 , 
x = 3 and x = -5. 
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What will the graph of y = f(x) = 2x 3 - 5x 2 - 6x + 9 look like? 

3 

We know that it must cut the x-axis three times, at x = 1 , x = 3 and x = -j- 

It also seems reasonable to say that, if we find enough values of y from feeding in values 
for x into f(x), the graph would be able to be drawn in one continuous line. 

If we putx = 0, we get f(x) = 9, so we know that the curve cuts thej-axis at the point (0,9). 

If x is large and positive, which has the most powerful effect: the 2x 3 or the -5x 2 ? Try 
putting x = 2, x = 10 and x = 100. You will see that, as x gets larger, the 2x 3 term swamps 
out the -5x 2 term. So y will also become large and positive. 

In just the same way, if x is large and negative, 2x 3 will also be large and negative, and 
so y also is large and negative. 

We now know enough to make a sketch of the graph of f(x) and I show this below in 
Figure. 2.E.I. 



"3 




Figure 2.E.1 f(x) = 2x 3 - 5X 2 - 6x + 9 

This is the best that we can do at the moment. With straight lines, we could also use the 
steepness or gradient to help us with the graph sketch. With quadratic graphs, we were able 
to complete the square to find the least (or greatest) value of the graph. You might perhaps 

feel that, since we can find the value of y for any value of x here, surely we ought to be able 

3 

to find out a bit more about the size of the greatest value coming between x = -5 and x = 
1, and the least value coming between x = 1 and x = 3. We can't discover these values yet, 
except approximately by trying lots of values of x, but we shall find out how it is possible 
to do it in Section 8.E.(b). 

example (1) We will now solve the equation f(x) = 3x 3 + 2x 2 - 12x - 8 and use the 
roots to sketch the graph of y = f(x). 

(f(x) is now referring to the new collection of terms of 
3x 3 + 2x 2 - 12x - 8. We could also have used some other letter, calling 
it, say, g(x) or h(x) if we had wished.) 

First, we hope to find a root of f(x) = 0. Can you find one? 



This time, if we try x = 1, we get /(l) = 3 + 2- 12 - 8^0 so x=l 
is not a solution of f(x) = 0. 

Puttingx = 2gives /(2) = 3 X 8 + 2 X 4- 12 X 2-8 = 0 so x = 2 
is a root. This means that (x - 2) is a factor of f(x). We can now say 

f(x) = 3x 3 + 2x 2 - 12x - 8 = (x - 2)(3x 2 + px + 4) 

matching up the two end terms in the right-hand bracket and letting p 
stand for the number which we still have to find. 

Matching up the terms in x 2 , we have 2x 2 = -6x 2 + px 1 so p = 8. 
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Checking with the terms in x, we have -12x = -2px + 4x so p = 8 is 
correct. 

(It is also possible to find the second bracket here of (3x 2 + 8x + 4) 
by long division of (x - 2) into 3x 3 + 2x 2 - \2x - 8, but I think the 
method above is easier. I shall show you how to do long division in 
algebra in the next section.) 

We now have 

f(x) = (x- 2)(3x 2 + 8x + 4) = (x - 2)(3x + 2)(x + 2) 
factorising the second bracket, and the equation fix) = 0 has the three 

2 

solutions or roots: x = 2 or x = -5 or x = -2. 

We will now use these three roots to help us to sketch the graph of 

y Jin 

Putting x = 0 gives us /(0) = -8, so the curve of y =f(x) cuts the 
j-axis at the point (0, -8). 

f(x) will behave in a similar way to the first example when x takes 
very large positive or negative values, so we now use all the information 
we have to draw the sketch in Figure 2.E.2. 




Figure 2.E.2 f(x) = 3X 3 + 2X 2 - 12X - 8 



exercise 2.E.1 For each of the following, first find the roots of fix) = 0 and then use these to 

help you to sketch the graphs of y = fix) in each case. For each graph, you will 
also need to find out where it cuts the y-axis, and how fix) behaves when x takes 
either very large positive values or very large negative values. 

(1) y = fix) = 3X 3 + 2x 2 - 3* - 2 (2) y = fix) = 2 + 3X - 3X 2 - 2x 3 
(3) y = fix) = 4* 3 - i5* 2 + 12X + 4 (4) / = f{x) = x 3 -3X 2 + 3X-1 

We could use exactly the same method to solve equations which start with a term in x 4 . 
The only problem is that it depends upon being able to guess some roots correctly to start 
with. Often, none of the roots of fix) = 0 will be simple whole numbers, and indeed they may 
not even be real numbers, as we have already found with some quadratic equations. If this 
happens, the graph sketches will no longer look like the ones we have drawn, though in the 
case of a cubic graph it will have to cross the x-axis at least once, because the y values must 
go from large negative to large positive or vice versa, and the graph itself is a continuous 
line. So a cubic equation will always have at least one real root (that is, a root which can be 
found on the x-axis). 

Also, once we have got beyond quadratic equations, general formulas for finding the 
roots are either far more complicated or do not exist at all. It is, however, possible to use 
numerical methods for solving such equations by approximating to the roots with any 
desired degree of accuracy. 
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2.E.(b) Doing long division in algebra 

Usually long division in algebra can be avoided (as we did in the last section when we used 
the method of matching up the terms on the two sides for factorising), but sometimes this 
isn't possible, so we will now look at how this process works. 

2x 3 + 9x 2 - 3x - 20 
We will take as a first example 

x + 3 

We will have: 

X x* - + 3x - 1 2 

Xx* + 6x* 

SxN-Sx 

- I2x-20 

- nx-36 

Figure 2.E.3 

The working for the division is set out as I have shown in Figure 2.E.3. 
x + 3 is called the divisor and 2x 3 + 9x 2 - 3x - 20 is called the dividend. 
The process consists of the following. 

• Divide the highest power by the highest power in the divisor. 

Here, divide 2x 3 by x, which gives us 2x 2 . 

• Multiply the divisor by this quantity. 

Here, we multiply x + 3 by 2x 2 to get 2x 3 + 6x 2 . 

• Subtract. This gives us the mismatch at each stage. 

Here, we get 3x 2 . 

• Bring down the next term in the quantity being divided to the working level. 
Here, we now get 3x 2 - 3x. 

• Repeat the process until the highest power of x in the divisor is greater than the 
highest power of x it would be divided into. 

What is then left is called the remainder, and the result of the division is called the 
quotient. 

Here we have the result 

2x 3 + 9x 2 - 3x - 20 16 

= 2x 2 + 3x - 12 + . 

x + 3 x + 3 

The quotient is 2x 2 + 3x - 12 and the remainder is +16. 

Compare this with the numerical example 

187 7 

= 12 + — . 

15 15 

We see that 15 goes 12 times into 187 with a remainder of 7. 



2.E The Remainder and Factor Theorems 



79 



Here is another example of long division, this time with no remainder. 

If (x - 3) is a factor of 2x 3 - 9x 2 + Ix + 6 then it must divide into it exactly, (just as 3 
is a factor of 12 and divides into it exactly four times). 

We will now prove that (x - 3) is a factor of 2x 3 - 9x 2 + Ix + 6 by using long division. 
The working is shown in Figure 2.E.4. 



Figure 2.E.4 



In practice, it is almost always possible to avoid long division if you do not take kindly 
to it; we managed to do this when we were doing the factorising earlier, and there are other 
ingenious methods which can be used which I will show you as you need them. 



2.E.(c) Avoiding long division - the Remainder and Factor Theorems 

In Section 2.E.(a), we found that if f(x) = 2x 3 - 5x 2 - 6x + 9 then f(l) = 0. 

It is certainly true that if (x - 1) is a factor of f(x) then putting x = 1 will make/(x) = 0. 
We assumed in that section that this would work the other way round too, so that if/(l) = 0 
then (x - 1) must be a factor of f(x). We shall now prove that this assumption was justified, 
and we shall also find a very neat way of finding the remainder from doing an algebra long 
division without actually having to do this rather tedious process. 

We prove these useful results as follows: 

Suppose we have some general cubic equation f(x) = ax 3 + bx 2 + cx + d, and we divide 
it by (x - k). (Here, a, b, c, d and k are all standing for whatever particular numbers we might 
have.) We will get 

ax 3 + bx 2 + cx + d R 

= q(x) + (1) 

(x - k) (x - k) 

where q(x) corresponds to the 2x 2 + 3x - 12 of the first example in the last section, and R 
corresponds to the remainder of +16. 

Now we multiply all through by (x - k). This gives us 

ax 3 + bx 2 + cx + d = (x - k)q(x) + R. 

We can compare this with an arithmetical example. 

79 4 

— = 5 + — so 79 = 5 X 15 + 4. 
15 15 

15 goes 5 times into 79 with a remainder of 4. In other words, 79 is made up of 5 lots of 
15 with an extra 4 added on. 
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Here, ax 3 + bx 2 + cx + d is made up of (x - k) lots of q(x) with an extra R added on. 

Since we have f(x) = ax 3 + bx 2 + cx + d = (x - k) q(x) + R, putting x = k gives us 
f(k) = ak 3 + bk 2 + ck + d= (k- k) q (k) + R, that is, f(k) = R. 

From this, if f(k) = 0 then R = 0 also, which means that (x - k) divides into f(x) exactly. 
It is a factor of f(x). 

We now have the following pair of results. 



If we have f(x) = ax 3 + bx 2 + cx + d 

then dividing f(x) by (x - k) gives a remainder of f(k). 

This is the Remainder Theorem for cubics. 

lff(k) = 0, then (x - k) is a factor of f(x). 

This is the Factor Theorem for cubics. 



We now see how we can use these results by looking at the two long division examples 
from the previous section. 

In the first example, we divided f(x) = 2x 3 + 9x 2 - 3x - 20 by (x + 3). To find the 
remainder, we no longer need to do this division. All we have to do is to work out 
/(-3) = 2(-3) 3 + 9(-3) 2 - 3(-3) - 20 = -54 + 81 + 9 - 20 = 16 which agrees with the answer 
that we found there. 

^^^^ Notice the switch in sign from x + 3 to /(- 3). 

^H-^V This is because x + 3 = x - (-3) which corresponds to the x - k. 

If we only need to know the remainder from a long division, we can now find this just by 
working out f(k). 

In the second example, putting x = 3 in f(x) = 2x 3 - 9x 2 + 7x + 6 gives us 
f(3) = 54 - 81+21 + 6 = 0 so therefore (x - 3) must be a factor of/(x). 
Again, we don't need to do the long division to prove this. 

Although we have taken the special case of f(x) being a cubic expression, the argument 
would have worked in exactly the same way for higher whole number powers of x, so these 
two theorems are true for any such expression. 

2.E.(d) Three examples of using these theorems, and a red herring 

example (l) Find the remainder when f(x) = 3x 3 - 4x 2 + 5x - 2 is divided by (x - 2). 

We simply find f{2). This is 3(8) - 4(4) + 5(2) - 2 = 16 so the remainder is 
1 6 and we have not had to do the actual division to find this out. 

example (2) Given that (x - 4) is a factor of f(x) = 6x 3 + ax 2 + bx + 8 and that the 
remainder when f(x) is divided by (x + 1) is - 15, find a and b and the 
other two factors. 

We have 

f(x) = 6x 3 + ax 2 + bx + 8. 
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We are told that (x - 4) is a factor, therefore /(4) = 0. So 

/(4) = 384 + 16a + 4b + 8 = 0 and 4a + b = -98. (1) 

The remainder when /(x) is divided by (x + 1) is - 15. So 
/(-l) = - 15. We have 

/(- 1) = -6 + a - b + 8 = - 15 so a - b = - 17. (2) 

Adding equations (1) and (2) gives 5a = -1 15 so a = -23. 
Substituting in (1) gives -92 + b = -98 so b = -6. 
Check in (2): LHS = -23 - (-6) = -17 = RHS. 

Now we have 

/(x) = 6x 3 - 23x 2 - 6x + 8 = (x - 4)(something). 

Comparing the two sides, the first term in the second bracket must be 
6x 2 . The last term of the second bracket must be -2. Let the middle 
term be px. Then we have 

6x 3 - 23x 2 -6x + 8 = (x - 4)(6x 2 + px - 2). 

Matching the terms in x 2 gives 

-23x 2 = -24x 2 + px 2 so p = \. 

Checking with the term in x we have -6x = -4px - 2x so again we have 
p = 1. So we have 

f(x) = (x - 4)(6x 2 + x - 2) = (x - 4) (2x - l)(3x + 2) 

factorising the second bracket. 

The other two factors are (2x - 1) and (3x + 2). 

example (3) This example is just sufficiently different that you might find it a little 
difficult. 

Suppose you have been asked to show that x 2 - 4 is a factor of 

3x 3 + 4x 2 - 12x - 16. Can you see that you have actually been asked 

about two factors? What are they? 



We can use the difference of two squares to say x 2 - 4 = (x - 2)(x + 2). 
Now,/(2) = 24 + 16 - 24 - 16 = 0 so (x - 2) is a factor. 
f(-2) = - 24 + 16 + 24 - 16 = 0 so (x + 2) is a factor also. 

If two factors are multiplied together, then the resulting expression is 
also a factor. 

example (4) (This is the red herring.) Solve the equation 4x 4 - 37x 2 + 9 = 0. 

It is possible to solve this equation by finding two solutions by 
guessing, but they are quite hard to find, and there is a much neater and 
quicker way of finding the answers. 

This is because what we have been asked to solve is really a heavily 
disguised quadratic equation. 
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If we put y = x 2 , the equation becomes Ay 2 - 37y + 9 = 0. 

Factorising, we get (y - 9) (Ay - 1) = 0 so y = 9 or y = \. (If you 
couldn't spot these factors, you could have used the quadratic equation 
formula to find y.) 

Replacing y by x 2 , we get x 2 = 9 or x 2 = \. 

This gives us the four solutions of x = ±3 or x = ±5. 

exercise 2.E.2 Try these questions for yourself now. 

(1) Show that (x - 2) is a factor of x 3 + 2x 2 - 5X - 6, and find the other two. 

(2) Show that (x - 3) is a factor of 2x 3 - 3X 2 - 8x - 3, and find the other two. 

(3) Factorise completely the expression /(x) = 3X 3 + x 2 - 12X - 4, and hence solve 
the equation f(x) = 0. 

(4) Factorise completely the expression f(x) = 2x 3 + jx 2 + 2x - 3, and hence solve 
the equation /(x) = 0. 

(5) Solve the equation f(x) = x 4 - 29X 2 + 100 = o. 

(6) Given that (x - 3) is a factor of /(x) = 5X 3 + ox 2 + bx - 6, and that the 
remainder when f(x) is divided by (x + 2) is -40, find a and b, and the other 
two factors. 

(7) Show by using long division that (3X - 2) is a factor of 12X 3 + 4X 2 - ijx + 6. 
Show also that this is true by using the Factor Theorem. 

(8) Using long division, find the remainder when 6x 3 + 5X 2 - 8x + 1 is divided by 
(2x - 1). Check that your answer is correct by using the Remainder Theorem. 
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3 Relations and functions 

We now build on the work of the previous two chapters to introduce functions. 
These are very important in scientific and engineering applications, and this chapter 
helps you to understand how they work. 
It is split up into the following sections. 

3.A Two special kinds of relationship 

(a) Direct proportion, (b) Some physical examples of direct proportion, 

(c) More exotic examples, 

(d) Partial direct proportion - lines not through the origin, 

(e) Inverse proportion, (f) Some examples of mixed variation 

3.B An introduction to functions 

(a) What are functions? Some relationships examined, 

(b) y = f(x) - a useful new shorthand, (c) When is a relationship a function? 

(d) Stretching and shifting - new functions from old, 

(e) Two practical examples of shifting and stretching, 

(f) Finding functions of functions, 

(g) Can we go back the other way? Inverse functions, 

(h) Finding inverses of more complicated functions, 

(i) Sketching the particular case of f{x) = (x + 3)/(x - 2), and its inverse, 
(j) Odd and even functions 

3.C Exponential and log functions 

(a) Exponential functions - describing population growth, 

(b) The inverse of a growth function: log functions, 

(c) Finding the logs of some particular numbers, (d) The three laws or rules for logs, 

(e) What are 'e' and 'exp'? A brief introduction, 

(f) Negative exponential functions - describing population decay 

3.D Unveiling secrets - logs and linear forms 

(a) Relationships of the form y = ax", (b) Relationships of the form y = an x , 
(c) What can we do if logs are no help? 



3.A Two special kinds of relationship 

We start this chapter with some more practical examples of the use of equations. Many 
physical laws can be described by the two particular sorts of relation which we shall consider 
next. 



3.A.(a) Direct proportion 

This describes a situation in which two quantities are related together so that as one gets 
bigger the other does also, in the same proportion. If the first quantity is doubled then the 
second quantity will be doubled also. We could take as an example the number of identical 
objects bought and the price paid. 
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The relationship between the number pairs making up the coordinates of the points on the 
straight line shown in Figure 3 .A. 1 also fits this description because it passes through the 
origin. 

Fill in the blanks for the points C, D and E yourself. 




Figure 3.A.1 



You should have C is (6,3), D is (8,4) and E is (12,6). 

Each fraction ylx gives the gradient of the line because all of them give the relative 
change of y with respect to x measured from the origin. We have 



12 3 4 
2 4 6 8 



6 y 

— = — = the gradient, m. 
12 x 



For any two general pairs (x^) and fej^X we have y\lx\ = y 2 lx 2 = i- We know from 
Section 2.B.(f) that the equation of the line through these points is given by y = jx. The j is 
called the constant of proportionality and tells us the relation between this particular set of 
_ys and xs. 



If two quantities x and y vary directly then we can write 

x <* y or x = ky where k is a constant. 
The symbol °c means 'is proportional to'. 



3.A.(b) Some physical examples of direct proportion 

Here are some examples of physical quantities which are related in this way. 

example (l) Charles' Law of gases. This states that the volume, V, of a certain mass 
of gas is directly proportional to its temperature, T, measured from 
absolute zero, which is -273 °C. Therefore we can say 

V =c T or — = — etc. or V = kT. 
T x T 2 

where k is the constant of proportionality. The numerical value of k will 
depend upon the units in which we measure V and T. 
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example (2) The volume, V, of a cylinder of a given cross-section is directly 

proportional to its height, h. (This is shown with two such cylinders in 
Figure 3.A.2.) 




Each ctjlnnde-r has 
the Same radius. 

Figure 3.A.2 

We can say V^h or V 1 /h l =V 2 /h 2 or V = kh. 
Can you see what k will be this time? 



The formula for the volume of a cylinder is V = na 2 h, so k = no 1 . 

example (3) For simple tension or compression (so no bending is involved), stress, a, 
is directly proportional to strain, e. 

We can say a e or o 1 /e 1 = o 2 /e 2 or a = Ee where E is 
the constant of proportionality 

A possible (rather simplified) situation is shown in Figure 3.A.3(a). 




Figure 3.A.3 

Figure 3.A.3(b) shows the cross-section of a typical test specimen with a pre-determined 
gauge length to perform the test on, and large end pieces to enable them to be clamped 
firmly 

The strain is the fractional change in length, and the stress is the stretching force per unit 
cross-sectional area. AL stands for the change in the original length, L. (The symbol 'A' is 
often used to mean 'the change in'.) 
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So we have 

AL F AL 

e = and o = — and therefore FIA = E . 

LA L 

E, the constant of proportionality, is called Young's Modulus of elasticity and is a physical 
property of the particular material concerned. 

Physically, the relationship will only be one of direct proportion, and so represented by 
a straight line through the origin, up to a certain critical point which will depend upon the 
properties of the material concerned. When the strain is increased beyond this critical value, 
deformation takes place and the material behaves differently. The mathematical model of 
direct proportion only works over a limited physical range. 

3.A.(c) More exotic examples 

example (1) The kinetic energy, E, of an object of mass Mmoving at a speed of v is 

given by the relation E = \Mv 2 . (Notice that we have used the symbol E to 
mean different things in this example and the last one. This is because 
engineers and physicists do commonly use this same letter with these two 
different meanings. It is very important in any practical application to 
make sure that you know what the different symbols represent.) 

For two objects moving at the same speed, v, the kinetic energies will 
be directly proportional to the masses of the objects. For example, a 
lorry of mass 6 tonnes moving at a speed of 10 m s _1 has six times the 
kinetic energy of a car of mass 1 tonne, also moving at 10ms -1 . 

But how does the kinetic energy of the car compare when it is 
moving at a speed of 10 m s _1 to when it is moving at a speed of 
30 ms- 1 ? 

The speed is now three times greater but the kinetic energy is 
proportional to the square of the speed. Therefore the kinetic energy is 
nine times greater. 

Here, E = kv 2 with this particular k being j since the mass of the car 
is one tonne. 

example (2) The area of a circle, A, of radius r is given by A = Jtr 2 . 
What is A directly proportional to? 
What is the constant of proportionality? 



A is directly proportional to r 2 , and the constant of proportionality is 71. 



The table below shows possible values for A, r and r 2 . 



A 


0 


71 


4tt 


9ti 


16jt 


25jt 


r 


0 


1 


2 


3 


4 


5 


r 2 


0 


1 


4 


9 


16 


25 



Figure 3.A.4(a) shows a sketch of the graph of .4 against r, and Figure 3.A.4(b) shows a 
sketch of the graph of A against r 2 . 
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From these you will see that plotting A against r gives a graph of the same form as y = x 2 , 
but plotting A against r 2 gives a straight line through the origin of gradient n. 



example (3) The volume, V, of a sphere of radius r is given by V = j Trr 3 
What is F directly proportional to? 
What is the constant of proportionality? 



1 4 

Vis directly proportional to r and the constant of proportionality is 3 jt. 

example (4) In Section 2.A.(d), we used the formula T= lie yftig for the period, T, of a 
simple pendulum of length /. (g stands for the acceleration due to gravity.) 
What is T directly proportional to here? 
What is the constant of proportionality? 



Tis directly proportional to v'/, the square root of the length, so T= kyfT. 

The constant of proportionality is Ittl^g- (This is assuming that the 
acceleration due to gravity can be taken to be constant when we are 
making our measurements.) A graph of T against -fl will give a straight 
line through the origin with gradient Ijiljg. 

exercise 3.A.1 Try answering these questions yourself. Each question is an example of a 

relationship involving direct proportion, and you are asked to compare pairs of 
physical measurements. 

(1) Compare the volumes of the cylinders (a) A and B (b) C and D shown in Figure 
3.A.5. 

(2) Compare the kinetic energy, E lt of a car moving at a speed of 5 ms" 1 with its 
kinetic energy E 2 when it is moving at 30 m s~\ 

(3) Compare the volumes V 1 and V 2 of two spheres if the first sphere has a radius 
of 2 cm and the second has a radius of 8 cm. 

(4) Compare the time of the swing of a simple pendulum of length 9 cm with a 
pendulum of length 25 cm. 
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Figure 3.A.5 



3.A.(d) Partial direct proportion - lines not through the origin 

We have seen that every direct proportion relationship gives us a straight line graph through 
the origin. 

Can we give any physical meaning to pairs of points lying on a straight line which doesn't 
pass through the origin? 

If we take any straight line, so that its equation can be written in the form y = mx + c 
(Section 2.B.(f)), then y is partly directly proportional to x and partly made up of the 
constant, c. 

An electricity bill is a physical example of such a relationship. This is made up partly of 
the cost of the number of units of electricity used and partly of a standing charge which is 
a constant amount added to each bill. (See Figure 3.A.6.) 




Figure 3.A.6 

The equation for a typical electricity bill might read_y = lA2x + 910 where the cost in 
pence per unit used is 7.42 and the standing charge is £9.10. 
y, the total cost, is given in pence by this equation. 

There are many other physical situations which can be described in a similar way. A second 
example is given by the relationship between the volume and the temperature of a gas if we 
don 't measure the temperature on a scale starting from absolute zero. This is because we can 
only have zero volume if the temperature is also at absolute zero, so measurements on a 
temperature scale which starts from here are necessary to make the line pass through the 
origin. 

If the temperature is measured in °C, we shall get a graph like the one shown in Figure 
3.A.7. 
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The equation which relates the volume to the temperature is V = kT + V 0 where k (the 
gradient) = F 0 /273. 

Compare this with the graph of Figure 3. A. 8 which shows the simple relationship of 
direct proportion of volume to absolute temperature, so V = kT. (The absolute temperature 
is measured in degrees Kelvin where 0 K is equivalent to -273 °C.) 




Figure 3.A.8 



In the second graph we have effectively shifted the vertical axis back by 273 °C. We see 
that the mathematical model which correctly describes the physical situation depends upon 
the units we choose to measure in. 

3.A.(e) Inverse proportion 

Two quantities are in inverse proportion if, as one gets larger, the other gets proportionally 
smaller and vice versa. 

For example, if 24 apples are to be shared out equally among different numbers of people, 
we have all the possibilities shown in the table below. 



x (number of apples) 


1 


2 


3 


4 


6 


8 


12 


24 


y (number of people) 


24 


12 


8 


6 


4 


3 


2 


1 



Evidently, in each case xy must be equal to 24. 
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If we plot these pairs of values we no longer get a straight line graph. (The graph we get 
is shown in Figure 3.A.9(a). 



Aut people) 

IS 
u 



o <o 

Figure 3.A.9 



»Z IS 24 



a. — >x 



(apples) 




12. I? 3.4 
( breadth in cm) 



Nor can we reasonably join the points together to form a curve unless we start dividing 
up the apples (or, even more alarmingly, the people). 

However, if we consider instead the possible variation in the measurements of the length 
and breadth of a rectangle of a given area of 24 cm 2 , we get exactly the same pairs of values 
as in the table above but we also get all the intermediate values too, including fractions as 
in the pair j and 48, and irrationals such as ^j24, since ^24 X ^J24 = 24. 

This time, the set of all possible pairs does give a smooth curve and this is shown in 
Figure 3.A.9(b). 

Notice what happens at the two ends of this curve. 

As we make one measurement smaller, so the other measurement has to become 
correspondingly larger to give the fixed area of 24 cm 2 . If the rectangle gets very thin it will 
also have to be extremely long. The points on the curve become closer and closer to the two 
axes but they can never touch since a zero measurement either way gives a zero area. Lines 
like this which a curve approaches but never touches are called asymptotes. 

The relationship here is that / X b = 24 which is a constant. 

A relationship of inverse variation can always be written in this form. 



If two quantities x and y vary inversely, 

then we can write xy = c where c is a constant. 



Another physical example of inverse variation is Boyle's Law for gases which states that, 
for a given mass of gas at a constant temperature, the pressure is inversely proportional to 
the volume, soPF=a constant. 
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3.A.(f) Some examples of mixed variation 

Some physical laws involve a combination of direct and inverse variation. Here are two 
examples. 

(1) For a given mass of gas, Boyle's Law and Charles' Law can be combined into a 
single law which states that PVIT = a constant. 

(2) Newton's Law of gravitation states that F, the force of attraction between two bodies 
of masses mj and m 2 whose distance apart is r, is given by F = k mitn 2 /r 2 . 

This force is directly proportional to the product of the masses, and inversely 
proportional to the square of the distance between the bodies. 

In this first section, we have looked at how some physical relationships can be expressed 
mathematically. If it is possible to describe a physical situation in a mathematical way, it will 
then be possible to obtain reliable and exact information about how the physical variables 
interact with each other. But it is important to realise that the information will only be as 
reliable as the fit of the mathematical model itself to the particular physical situation which 
it is describing. For example, the extension of a spring can be predicted for a known load but, 
if the load is too great, the spring deforms and the new length can no longer be found. 

3.B An introduction to functions 

3.B.(a) What are functions? Some relationships examined 

To be able to describe physical situations mathematically, and so to be able to extract 
detailed information about how they can behave, you need to be confident about handling 
the necessary maths. This next section is about different kinds of mathematical relationship 
and how they work. In particular, we shall look at the special relationships which are called 
functions. 

Suppose we consider the four equations: 

(a) y = 2x + 3, (b) y = x 2 - 2x - 3, 

(c) y = -y— — (d) y = (3x + if' 2 . 
x 2 + 4, 

Each of these gives a relationship between x and y from which we could build up a set of 
ordered pairs or coordinates to draw a graph. 

For each of these four in turn, try answering for yourself the following four questions. 

(1) If you feed different values of x into the relationship, is there just one 
corresponding value of y for each possible value of x? 

(2) Does every new value of x which you feed in give you a correspondingly new value 
of y, or do you sometimes find that two different values of x lead to the same y 
value? 

(3) Do you think that you could reasonably choose any real number as a value of x to 
feed into each of the four cases above? (That is, could you choose any number 
which lies somewhere on the x-axis? Section l.E gives you a description of all the 
different kinds of number which can be found here.) 

(4) Finally, if we make the set of x values as large as possible in each case, what 
happens to the complete set of possible values for yl Is it the same as the set of 
possible values for x? If not, what is it? 
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It will very much help your understanding if you think about these four questions 
carefully yourself and write down what you think is going to happen in each case before you 
go on to look at my answers. 



I will answer the four questions for each example in turn. 

(a) y = 2x + 3 

It is clear that for every value of x which we feed in there is just one possible value ofy, and also 
that each value of y can only come from one possible value of x. Also there is no reason for 
excluding any real number from the possible values of x if we want to make the choice as wide 
as we can. Likewise, y can take all real values. We can see this graphically in Figure 3.B. 1 . 




Figure 3.B.1 



The arrows indicate that the line is infinitely long in either direction. Imagining this 
extension, we see that all possible values of x are included, and also all possible values of 
y. Also, each x value gives only one possible y value, and vice versa. 

(b) y = x 2 - 2x - 3 

This time, for every value of x which we feed in, again there is only one possible value ofy. 

But what about the other way round? For example, if we put x = 4 we get y = 5, and if 
we put x = -2 we also get y = 5. Similarly, both x = 3 and x = -1 give y = 0, so the answer 
to question (2) is 'no' for this relationship. 

The graph sketch looks like Figure 3.B.2. We also see from this that, while there is no 
reason why we shouldn't choose any real number for an x value, the possible values for y 




Figure 3.B.2 
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only go down to the lowest value of the curve. This we can find by completing the square 
like we did in Section 2.D.(b) in the last chapter. 

We have y = x 2 - 2x - 3 = (x - l) 2 - 1 - 3 = (x - l) 2 - 4. 

The least possible value of y is -4 and this happens when x = 1 . 

We see that the range of possible values for y is restricted, because y > -4. 

1 



Again, it is clear here that each value of x fed in gives only one possible value of y. But, like 
last time, we can get the same y value from two different values of x. 

For example, if x = +1 then y = \ and if x = -1 then y = \ also. Notice that every 
symmetrical pair of ± values of x will give the same value for y. 

There is no reason not to allow all possible real numbers as values for x, but think 
carefully about what happens to y\ 

First of all, x 2 + 4 must always be positive, so y is always positive. 

The least value of x 2 + 4 is 4 when x = 0. This gives a corresponding value of y = \ so 
the point (0, 5) lies on this curve. 

Also, y must have its largest value when x 2 + 4 has its least value since y = l/(x 2 + 4). 
As x becomes larger, y becomes correspondingly smaller. (Large positive values of x will 
have the same effect as large negative values since x is being squared.) The graph will be 
symmetrical about the y-axis. You can check this using your calculator if you like; putting 
in a few values such as x = ±1, x = ±2 and x = ±4 also helps with drawing the sketch of 
Figure 3.B.3 below. 




Figure 3.B.3 

We see that the possible values of y lie between 0 and \. 

Also, y can have the value of 4, but it never actually reaches 0 although it gets infinitely 
close to it. We say that the values of y lie in the interval from 0 to \ on its number line, with 
the value \ included, but 0 excluded even though, by taking a sufficiently large value of x, 
we can get as close to 0 as we please. 

We write this interval (0, 5]. The round bracket means that we don't include that end point 
in the set of possible values; and the square bracket means that this end point is included. 

(d) y = (3x+l) m 

Firstly, we see that, unlike the other three, here we can get more than one value of y for just 
one value of x. For example, if x = 5, y = 16 1/2 so y = ±4. (Remember that the 
convention is that yj means 'the positive square root', so if we had written y = yf3x + 1 we 
would have avoided the complication of double-valued ys.) 
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However, it does look as though each possible y value can come from only one x value. 
For example, if y = -5, we have (3x + 1) 1/2 = -5 so 3x + 1 = 25 and x = 8. 

Can we choose any real numbers for our values of x? Not unless we want complications 
coming from trying to take the square root of negative numbers, which is not something 
which we can yet do. 

We must keep 3x + 1 > 0 so 3x > -1 and x > -\. 

The possible y values include all the real numbers, however. 

You can see that this will be so from the example which we took of y = -5. For any 
chosen number, we could repeat this process. 

Figure 3.B.4(a) shows a sketch of the graph of y = (3x + 1) 1/2 . Figure 3.B.4(b) shows 
the graph ofy= yj 3x + 1 . If we always take the positive square root, we just get the top half 
of (a). 




3.B.(b) y = f(x) - a useful new shorthand 

To make explanations simpler, it is often helpful to write what we have so far been calling 
y as f(x), so that we have y =f(x). (We have already used this notation for cubic equations 
in Section 2.E.(a).) 

This means that y can be found from x according to some rule, in the way that the 
different _ys of (a), (b), (c) and (d) above can be found, for example. 
In the case of (a), we would have y =f(x) = 2x + 3, 

so f(2) = 4 + 3 = 7 and f(-3) = -6 + 3 = -3 etc. 

In case (b), 

y=f(x) = x 2 -2x-3, so f(0) = -3 and /(3) =/(-l) = 0 etc. 

This notation is particularly useful when we want to talk about specific values, as we 
have done here. It is also useful for making clear what the variable quantity is. 

An example of this is the case of the ball thrown up in the air, given in Section 2.D.(g). 
There, we used the formula s = ut - jgt 2 to find s, the distance moved from the thrower's 
hands. Both u and g are constants, and t gives the changing measurement of time. Therefore, 
we could write s =f(t) meaning that the distance moved is a function of the time that the 
ball has been in the air. 

A function is a particular form of relationship. Just what makes it particular is the subject 
of the following section. 
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3.B.(c) When is a relationship a function? 

We shall now use the answers which we have just found to the four questions above to lead 
us to some important definitions. 



If a relationship y =f(x) is a function then, for any chosen value of the variable x, 
there is only one corresponding possible value of y. 



Of the four examples from Section 3.B.(a), we found that (a), (b) and (c) are all functions, 
but (d) is not. However, y = ^j3x + I would have been. 

Looking at this requirement graphically, we see that any vertical line on the graph must 
never cut the curve more than once if it is the graph of a function. I call this the raindrop 
test; the raindrop is only allowed to hit the curve once as it slides down the paper. 



A function y =f(x) is called one-to-one if, for each value of y, there is just one 
possible value of x, and for each value of x there is just one possible value of y. 



Example (a) is one-to-one but neither (b) nor (c) are one-to-one since in both cases it is 
possible to have the same value of fix) for different values of x. 



The domain is the set of numbers from which we choose the possible values of x. 



In our four examples we deliberately made this choice as wide as possible, but as we saw 
in case (d), it may be restricted because of the formula involved. There might be 
circumstances in which you would choose to restrict the domain yourself. For example, if 
you were considering a physical problem in which x represented a length, you would require 
the domain to be restricted to positive numbers. 



The set of all possible values of y is called the range. 



We found that in (a) this was the complete set of real numbers (any value for y was 
possible), but in each of (b) and (c) it was restricted in some way. Case (d) is a bit more 
subtle: if y = (3x + 1) 1/2 then, as we can see from Figure 3.B.4(a), y can take any value. 
But, as we also saw there, y = (3x + 1) 1/2 isn't a function. If we force a function by writing 
y = y]3x + I then, as we can see from Figure 3.B.4(b), the possible values of y are 
restricted to y > 0. 

3.B.(d) Stretching and shifting - new functions from old 

What kinds of effect will we get if we create new functions from old ones by adding or 
multiplying the first function in various different ways? We will now look at the results 
obtained from four possible different types of alteration. 
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(1) Adding a fixed amount to a function 

What happens if we go from f(x) to f(x) + a, where a is some given constant number? Here 
are two examples, both taking a = 3. 

(a) f(x) = 2x+l (b) f(x)=x 2 

so /(x) + 3 = 2x + 4. so /(x) + 3=x 2 + 3. 

I show sketches of the two pairs of graphs below in Figure 3.B.5(a) and (b). 




Figure 3.B.5 



We see that the effect of adding 3 to f(x), so that y = fix) + 3, is to shift the graph up by 
3 units. 

(2) Adding a fixed amount to each x value 

What will happen if we add a fixed amount to each x value instead, so that we go from fix) 
to fix + a) in each case? Again, we look at two examples, taking a = 3. 

(a) /(x) = 2x+l (b) /(x) = x 2 

so fix + 3) = 2(x + 3) + 1 = 2x + 7. so f(x + 3) = (x + 3) 2 . 

Notice that, to find fix + 3) from fix), we just replace x by (x + 3). 

I show sketches of the two pairs of graphs in Figure 3.B.6(a) and (b). 

This time, the effect is to slide the whole graph 3 units to the left. Notice that the 
interesting bits happen 3 units sooner. For example, each contact with the x-axis happens 3 
units earlier now. 

^^^^ What actually happens here is not what you might think at first; notice that 
^B-^V fix + 3) is what you get if you slide /(x) three units to the left, not to the right. 

Because the function of (a) is a straight line, we can get the same effect as this sideways 
shift by giving the line an upwards shift of 6 units, so making fix) go to fix) + 6 with our 
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Figure 3.B.6 



particular f(x) of 2x + 1 . The only way we could tell which of these transformations had been 
done would be to keep track of what happened to particular points. For example, in the first 
case, the point (0, 1) goes to (-3, 1), as we can see on Figure 3.B.6(a). In the second case, 
(0, 1) would go to (0, 7). 

We could also get the same end result for the line by moving it both sideways and upwards. 
Once we allow two shifts, the number of different possibilities becomes infinite. 

(3) Multiplying the original function by a fixed amount 

What will happen if we go from f(x) to a f(x) where a is some given constant number? 
Working with the same two examples as before, and with a = 3 again, we get 

(a) | /(x) = 2x+l (b) | f(x)=x 2 

I so 3/(x) = 6x + 3. [so 3/(x) = 3x 2 

Sketches of the two pairs of graphs are shown below in Figure 3.B.7(a) and (b). 




Figure 3.B.7 



This time, the whole graph has been pulled away from the x-axis by a factor of 3, so that 
every point is now three times further away than it was originally. Therefore the only points 
on the graph which will remain unchanged are those on the x-axis itself. 
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(4) Multiplying x by a fixed amount 

What will happen if we go from f{x) to /(ax)? 

Taking our same two examples, with a = 3, we have 

(a) | /(x) = 2x+l (b) | f(x)=x 2 

I so f(3x) = 2(3x) + 1 = 6x + 1 [so /(3x) = (3x) 2 = 9x 2 . 

Notice that we simply replace x by 3x to find /(3x) from /(x). 

I show sketches of the two pairs of graphs below in Figure 3.B.8(a) and (b). 




Figure 3.B.8 



This time the stretching effect is more complicated because it only affects the part of the 
function involving x. Any purely number parts remain unchanged. The points which are 
unaffected by the stretching are those where the graphs cut the _y-axis, so x = 0. 

Notice too that the strength of the effect now depends upon the power of x. Having (3x) 2 
in example 4(b) gives a more extreme effect than the 3x 2 in 3(b), since the 3 is also being 
squared here. 

We can relate examples 3(a) and 4 (a) to the real-life situation of the electricity bill graph 
shown earlier in Section 3.A.(d). The positive parts of the two graphs of 3(a) correspond to 
a situation of increasing both the standing charge and the cost per unit by a factor of three, 
while the positive parts of the two graphs of 4(a) could show an increase in the cost per unit 
of three, but an unchanged standing charge. (In this physical application, negative values of 
x or j would be meaningless.) 

It has been easier in all these descriptions to stick to the same variable, x, for the 
functions. However, there is no reason why another letter should not be used. 

In the physical example in Section 2.D.(g), on the motion of a ball when it is thrown up 
in the air, we described the distance travelled in terms of t, the time from when it left the 
thrower's hands. 

We used the function s =f(t) = ut-j & , and the horizontal axis was a f-axis instead of 
an x-axis. 
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We have now looked at the four simplest kinds of transformation of functions, and their 
graphical effects. I will list these for you below. 



A summary of some effects of transforming functions 

(1) Transforming/^*) to f(x) + a shifts the whole of f(x) upwards by a distance a. 
We have 



(4) 




Figure 3.B.9 (a) 



(2) Transforming/^*) into f(x + a) shifts the whole of f(x) back a distance a, 

because the curve is getting to each of its values faster, by an amount a. We have 




Figure 3.B.9 (b) 

Shifts are sometimes called translations. 

(3) Transforming/(x) into af(x) stretches out each value of f(x) by a factor a. We have 




Figure 3.B.9 (c) 



Transforming/(x) into /(ax) has a more complicated effect, since how much a 
affects each part of f(x) depends on what is happening to x itself in f(x). For 
example, if f(x) = x 2 + x + 1 , then f(ax) = a 2 x 2 + ax + 1 . Each term has been 
affected differently. Therefore it is not possible to show this case on one sketch; 
the change in shape will depend entirely upon the function concerned. 
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The following exercise gives you a chance to practise recognising these shifts and 
stretches for yourself. Although / is the letter most commonly used for functions, it is 
sometimes more convenient to use other letters to avoid confusion. I do this here, having 
functions called g(x), h(x) etc. 

exercise 3.B.1 This exercise contains four questions, each of which involves one of the following 

four functions. 

(1) fix) = 3X - 1 (2) g(x) = 2x-2 (3) h{x) = \ x + 1 (4) p(x) = x 2 - qx + 3. 

Each question shows the original function on the left, followed by two 
examples of stretching or shifting it beside it. (See Figure 3.B.10 below.) 

You have to decide what particular stretch or shift has happened in each case, and 
then write it in beside its graph. (For example, in Figure 3.6.5(a) earlier, I showed the 
shift of f{x) to f{x) + 3.) Then check in the answers given at the back of the book to 
see if you have decided correctly. (Don't be tempted to go straight there!) 

To make the questions easier for you, the constant number involved in each 
transformation (its 'a') is always either +2 or -2. This also means that you will be 
able to tell whether I have shifted my straight lines up or down or sideways to get 
them to their new positions. 




Figure 3.B.10 
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3.B.(e) Two practical examples of shifting and stretching 
The method of completing the square 

When we do the process of completing the square for a quadratic expression, as we did in 
Section 2.D.(b), we are actually finding what shift we would need to do to make the curve 
sit on the x-axis. 

For example, if we take the curve y = x 2 - 4x + 9, we can use the method of completing 
the square to rewrite this as y = (x - 2) 2 - 4 + 9 = (x - 2) 2 + 5. 

The curve y = (x - 2) 2 , which I have drawn in Figure 3.B.1 1(a), just touches the x-axis 
when x = 2. 

The curve y = (x - 2) 2 + 5 is the result of shifting the curve y = (x - 2) 2 up by 5 units. 
I have drawn this in Figure 3. B. 11(b). 

We can see from this picture that y = (x - 2) 2 + 5 = x 2 - 4x + 9 has a minimum value 
of 5 when x = 2. 




Figure 3.B.11 



How we get the standard Normal distribution 

If you have used Normal probability distributions in statistics, you will already have met an 
application of stretching and shifting. Briefly, the situation here is that we can model the 
likelihood of certain types of measurements occurring within particular intervals by 
considering the area under a curve called a Normal distribution curve which I sketch below 
in Figure 3. B. 12(a). 

Two examples of the kinds of measurement which can have their likelihoods modelled by 
this kind of graph are the heights of all adult males, and the errors made in measuring a 
particular length as accurately as possible. In both cases, a large number of measurements 
will be bunched symmetrically about the mean and the more extreme examples will tail off 
fairly steeply either side. 




Figure 3.B.12 
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On the graph sketch, fi represents the mean or average measurement, and a represents a 
measure of how spread out these measurements are. The curve flexes itself at a distance o 
away from fi either side. 

The area under the curve gives the probabilities of measurements lying between certain 
values. For example, the likelihood of a randomly chosen x lying between x 1 and x 2 is given 
by the shaded area shown in Figure 3. B. 12(b). 

These areas are extremely difficult to calculate since the equation of the curve is 
mathematically complicated but since they are very frequently needed, tables have been 
calculated from which the different probabilities can be read off. 

There is only one problem: it would be impossible to print the tables for every Normal 
distribution curve, and the tables just give the results for the simplest possible case, which 
I show in Figure 3. B. 13(a). For this curve, fi = 0 and o = 1. The variable along the horizontal 
axis is called the standard Normal variable. This is always given the letter z. 

Beside the standard Normal distribution curve, I show again the general Normal 
distribution curve in Figure 3. B. 13(b). 




Figure 3.B.13 



How can we get from the curve shown in (b) to the curve shown in (a)? 



In order to transform (b) into (a) we have to shift the j-axis forwards by ft, so this would 
make z = x - fi. 

But this alone is not sufficient because, in (a), we have also squeezed the x measurements 
by a factor of 1/cr. So to get from (b) to (a), we put 

x - fi 

z = . 

o 

This is the formula for finding the standard Normal variable, z, which corresponds to a 
value x in a Normal distribution curve like Figure 3.B. 13(b) above with mean fi and standard 
deviation a. 

To sketch the correct graph, the y measurements have to be stretched by a factor of o 
since the total area under the graph remains one unit. (This is because it gives the sum of 
all the possible likelihoods or probabilities of the measurements concerned.) 

The equation of each Normal distribution curve is in terms of its particular fi and o, and 
this stretching of the y measurement takes place automatically in the new curve because of 
the property of unit area. 

Instead of having to find the area between x 1 andx 2 shown in Figure 3. B. 12(b) above, we 
can now use the tables to find the area between the corresponding zj and z 2 of the standard 
Normal curve. The tables give the two cumulative areas measured from the left-hand end of 
the curve up to z 1 and z 2 respectively, and the required area is the difference between these 
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two. Since the total area remains 1, this area is unchanged in the two graphs. It is just a 
different shape. 

There is one other rather neat spin-off from this transformation. Because the standard 
Normal curve is symmetrically placed about the origin, the tables only have to give values 
for one side. In practice, this is the right-hand side, and values for the left-hand side are 
found by using the symmetry of the curve. 

3.B.(f) Finding functions of functions 

In Section 3.B.(d), we were able to see graphically the effects that some simple changes have 
on functions. But suppose the changes are more complicated because they have been built 
up from a number of simple steps. It's not so easy then to work out what is happening 
geometrically, but it is easy to find out what has happened using algebra. We can think of 
these changes as involving functions of functions. 

Suppose we start with the two functions f(x) = 2x + 3 and g(x) = 5x. 

What kind of meaning can we give to the expressions f(g{x)) and g(f(x))l 

Do they mean the same thing? 

This is a topic which sometimes makes students nervous, so we will look at it in some 
detail. 

The instruction which f(x) gives us is to 'double and add three', so we will have 
/(lump) = 2 (lump) + 3, whatever the 'lump' may be. 

Similarly, g(lump) = 5(lump), whatever that lump may be. 

Therefore f(g(x)) =f(5x) = 2(5x) + 3 = lOx + 3 and g(f{x)) = g(2x + 3) = 5(2x + 3) = 
lOx + 15. 

The two results are different, and in general /(g(x)) will not be the same as g(f(x)). In 
fact, in this example, f(g(x)) is never equal to g(f(x)) for any value of x since we can't find 
an x so that lOx + 3 = lOx + 15. 

Notice the order of the operations. The inside function acts on x first, and then the outside 
function acts on the result. 

exercise 3-B.2 Try these for yourself. 

Find (a) /foM) (b) 5 (flx)) if 

(1) f(x) = 3x - 5 and g(x) = 2x 

(2) f{x) = x 2 and g(x) = 4 - x 

(3) /W = x and 900 = x ~ 4- 

Similarly, f(f(x)), which is the function of the function itself, holds no terrors. 
We'll look at two examples to prove that this is so. 

example (1) /(*) = 2x + 3 so f(f(x)) = 2(f(x)) + 3 = 2(2x + 3) + 3 = 4x + 9. 

We can check that this works by putting x = 2, say. Then we can find 
f(f{2)) either by doing / twice, getting /(2) = 7 and /(7) = 17, or in one 
step using f(f(x)) = 4x + 9 so f(f (2)) = 8 + 9=17. 
Try doing one for yourself before we go on. 
If g(x) = 2x 2 + 3 what is g(g(x))? Check with x = 1. 
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g(g(x)) = g(2x 2 + 3) = 2(2x 2 + 3) 2 + 3 

= 2(4x 4 + 12x 2 + 9) + 3 = 8x 4 + 24x 2 + 21. 



Check: g(l) = 5 and g(5) = 50 + 3 = 53. 
Alternatively, g(g(l)) = 8 + 24 + 21 = 53. 

2x + 3 

example (2) Now we'll find f(f(x)) if /(x) = . 

3x + 2 

To find f(f(x)) we simply replace the x of the formula by f(x), so we get 

2(rri) + 3 



3(gJ!) + 2' 



We then simplify this unwieldy fraction by multiplying top and bottom 
by (3x + 2). (Remember that this leaves the value of the fraction 
unchanged - see Section l.C.(a) if necessary.) So we have 

2(2x + 3) + 3(3x + 2) 13x + 12 

/(/"(*)) = = ■ 

3(2x + 3) + 2(3x + 2) 12x + 13 

We must exclude the one value of x for which the function is undefined 

13 

by saying x ^ - 75 • This value would make 12x + 13 = 0, and so involve 
us in trying to divide by zero which is impossible. (This is also in 
Section l.C.(a).) 

Try this very similar example for yourself, because it is also good practice for tidying up 
fractions within fractions, sort of double-decker fractions. See if you can get right through 
without referring back to the example above. (You could have another good look at that one 
first.) 

If/(x) = ^^ find (a)/(3),(b)/(x 2 ),(c)/(2x+l) and (dW(x)). 
4x + 1 



Here are the answers. 

First of all, you wouldn't even consider cancelling the 2 and the 4 in the definition of f(x). 
If you would, you should return to Sections l.C.(a) and (b) and go through them again! 
You should have: 

2(3) - 5 1 

(a) /(3) = 



(b) /(x 2 ) = 



4(3) + 1 13 
2(x 2 ) - 5 2x 2 



(c) /(2x+l) = 



4(x 2 ) +1 4x 2 + 1 

2(2x + 1) - 5 4x - 3 



4(2x + 1) + 1 8x + 5 
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2(^-7 

(d) /(/(.v)) ' 



4(*Lz±) + i 



2(2x - 5) - 5 (4x + 1) 

(multiplying top and bottom by (4x + 1)) 

4(2x - 5) + (4x + 1) 

-16* - 15 



12x - 19 
16x + 15 



(multiplying top and bottom by -1 to make the answer 



19 l2x look more tasteful). 



3.B.(g) Can we go back the other way? Inverse functions 

We have now worked with quite a large number of functions each of which gives us a rule 
for finding the function from any given starting value of x. We also know that, in order for 
this relationship to be a function, the rule must give just one possible answer for each 
starting value of x. 

Is it possible to go back the other way? If we know a value of f(x) for a particular function 
can we work out from this what the original value of x must have been? 
Can you see any difficulty which we might have? 



We can only do the backwards process if each value off(x) comes from just one possible 
x. This is why the answer to the second question of Section 3.B.(a) was so important. For 
example, in the case of function (b) which was_y =f(x) = x 2 - 2x - 3, we have /(4)=/(-2)=5. 
Therefore, from knowing that f(x) = 5, it is not possible to say what value of x gave this, 
since it could be either 4 or -2. Since the backwards relation has more than one possible 
answer, it is not a function. 



The function (if it exists) which undoes the effect of f(x) and brings you back to 
where you started is called the inverse function of x. It is written f~ l (x). 

• A function can only have an inverse function if it is one-to-one. This means that 
f(a) =f(b) only if a = b. 

• Iff- 1 (x) exists, then/- 1 (f(x)) =f{f~\x)) = x. 

• Each of / and f~ l undoes the effect of the other. 



^^^^ / l (x) does not mean l//(x). 

^K^F You can, if you want, write l/f(x) as (/"(x)) -1 . It is just unfortunate that the 
mathematical way of writing these two very different things looks so similar. 
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For simple functions, it is often very easy to see what the inverse function must be. Here 
are two examples. 

(1) If/(x) = x + 3, then/- 1 (x) = x - 3 so, for example, f(4) = 7 and/" 1 (7) = 4. 

(2) If g(x) = 5x then g~ l (x) = \x so g(2) = 10 and g~ ! (10) = 2. 

Graphically, these two examples correspond to shifting x up and then shifting back down 
by 3 units in the case of (1), and stretching x and then shrinking it back by a factor of 5 in 
the case of (2). (These graphical effects were looked at in Section 3.B.(d).) 

To make clearer what is happening here, it can sometimes be helpful to use an alternative 
way of writing functions which emphasises the carrying across or mapping of x into the 
function f(x). 

Taking/(x) = x + 3 as an example, we can also write this as/: x *-> x + 3 which means the 
function /in which x maps to x + 3. Then we write the inverse function as/ -1 : x >-» x - 3. 
Similarly, if g:x >-> 5x, then g -1 : x >-> \x. 

Try finding the inverse functions of the following three functions yourself. 
(l)/(x)=x-2 (2)g(x) = 2x (3)p(x) = 6-x 



You should have (1) / _1 (x) = x + 2 and (2) g _1 (x) = \x. 

Students often find (3) a little bit tricky. Clearly, it isn 't true that p' 1 (x) = 6 + x since this 
doesn't bring us back to where we started. 

If you haven't been able to find an answer, try finding p{\), p(5), p(2) and p(A). 

You will see that doing p{x) twice brings you back to the original x, so that p{x) is its own 
inverse function. We can say that p(p(x)) = x. 

A function which is its own inverse is called self-inverse. 



If /(x) is self-inverse, then / 1 (x) = f(x) so f(f(x)) = x. 



(4) Can you find the inverse function for q(x) = 12/x? 



Trying the pairs of values for x of 12 and 1, 6 and 2, and 3 and 4, shows us that this 
function is also self-inverse. These pairs of values are behaving symmetrically with respect 
to each other. 

This is the same kind of relationship as those that we looked at in Section 3.A.(e) on 
inverse proportion. However, unlike the physical examples of inverse proportion which 
we looked at there, this function also includes negative pairs such as -3 and -4, and -2 
and -6. 

I show in Figure 3.B.14 graph sketches for the pairs of functions and their inverses from 
the four questions above, taking equal scales on the x and y axes. 

This is a good place to add colour to the sketches yourself. If you use two colours 
so that you can highlight each function and its inverse function differently, you will bring 
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Figure 3.B.14 

out two important points. The first is that the two self-inverse functions are the same 
function; they lie on top of one another. The second is that all the four pairs of graphs 
shown have the same line of symmetry. Try sketching in this line yourself on each of the 
four graphs. 



Each function and its inverse function are symmetrically placed about the diagonal line 
y = x. This symmetry stresses the equal standing of each function with its inverse; each is 
the inverse of the other. They are mirror images of each other in the line y = x because the 
original function is taking x to y, and the inverse function takes y back to x. This symmetry 
means that the domain, the set of all possible x values for the original function, is the same 
as the range, the set of all possible y values for the inverse function, and the range of the 
original function gives the domain of the inverse function. 

For the two self-inverse functions, the original function is itself symmetrical about the 
line y = x. Each half of the line or curve reflects onto the other half, and therefore we can 
see geometrically that these functions must be their own inverses. 

Notice that this symmetry means that it is always possible to sketch an inverse function 
if we know what the original function looks like. This sketching is easier if equal scales are 
chosen on the two axes, so that the line y = x is at 45°. A quick sketch is much the easiest 
way of seeing how an inverse function works. 
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3.B.(h) Finding inverses of more complicated functions 

How can we find the inverse function if the starting function is more complicated? For 
example, what is f' 1 for f(x) = 2x - 5 or/: x >-» 2x - 5? 

It's not very easy to write down the answer immediately (Try it and see, checking with 
some numbers to see if your answer works.) However, we can work out what it must be in 
the following way 

We have y =f(x) = 2x - 5. This gives the rule or formula for finding y if we know x. 

We are looking for the rule which, if we know y, will take us back to the original x. We 
can find this by rearranging y = 2x - 5 to change it to the form x = some rule involving y. 
This is called changing the subject of the formula to x, and we have already done this for 
some physical formulas in Section 2.A.(d). 

We have y = 2x - 5 so y + 5 = 2x so x = \ (y + 5), so giving us the rule which will 
take us back from y to the original x. 

We can check that it works by doing a numerical test. If x = 3 then y = 6 - 5 = 1 and if 
y = 1 thenx = \ (1 + 5) = 3. 

We now use the rule we have found to write the inverse function so that it is itself a 
function of x. Using the mirror-image property of the function and its inverse about y = x, we 
simply swap x and y getting f' 1 (x) = 5 (x + 5). The line giving/ -1 (x) is y = j (x + 5). 

I show both /(x) and / _1 (x) in Figure 3.B.15. 




I have also shown 3 >-> 1 using /(x), and 1 >-> 3 using/ 1 (x). 
Can you work out where the two functions cross over each other? 
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They cross over where f(x) =f 1 (x) so 2x-5 = 5 (x+5) giving Ax -10 = x+5 so x = 5. 
Check: f{5) = 10 - 5 = 5 and/- 1 (5) = \ (5 + 5) = 5. 

The crossing point is at (5, 5) on the line y = x which checks with what we know must 
be true geometrically. 

We set about finding the inverse function for a function involving a fraction like 
f(x) = (x+3)/(x-2) in exactly the same kind of way We have 

x + 3 x + 3 

/(*) = or /:ih - 

x - 2 x - 2 

meaning that, under the function/ x maps to (x+3)/(x-2), so, for example, 3 maps to 6. Let 

x + 3 



where y gives the outcome of feeding x into the function, as 6 is the outcome of feeding 3 
into the function. 

As before, we are looking for a formula which, if we know y, will take us back to the 
original x, so we change the subject of the formula to x. 

x + 3 

y = so y(x - 2) = x + 3 so xy - 2y = x + 3. 

x - 2 

Now we collect all the terms with x in on the same side of the equation, because then we will 
be able to factorise. We have 

2y + 3 

xy - x = 2y + 3 so x(y - 1) = 2y + 3 so x = . 

y- 1 

We've now got the rule which, if we know y, will give us the original x. 

Just as we did in the last example, we can now use this rule, and the mirror-image 
property of the function and its inverse in the line y = x, to get the inverse function by 
swapping y and x. This gives us 

2x + 3 _ 2x + 3 

/ 1 (*) = or -. 

x — 1 x — 1 

Check: if we feed in x = 6 we have f~ { (6) =15/5 = 3. 

2x + 3 

To draw the graph of this inverse function, we would draw y = 

x - 1 

We shall look together at how we can sketch / and f~ l in the next section, but before that I'll 
give you a chance to find a few inverse functions for yourself. 

exercise 3.B.3 Find the inverse functions for each of the following functions. (Some of them you 

will be able to write down straight away and some of them will need rearranging 
like the last two examples.) 

(1) fix) = sx (2) f{x) = x - 9 (3) f(x) = 5X - 9 

(4) f(x) = 8 - x (5) fix) = x/n (6) fix) = nlx (7) / = 3 - 2x 

x - 3 2X + 3 
(8) /DO (x * -2) (9) fix) (x * 2.) 

X + 2 X - 2 

We say x * -2 in (8) and x * 2 in (9) to make it clear that we don't think that we 
can divide by zero. 
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3.B.(i) Sketching the particular case of f{x) = (x + 3)/(x - 2), and its inverse 

We will now look into how we can set about drawing graph sketches for 

x + 3 2x + 3 

/(*) = " and /" 1 (x) = -. 

x - 2 x - 1 

Each of these functions is more complicated than any that we have sketched so far, but they 
have interesting properties that it will be useful for you to see here. Also, if we can draw a 
sketch for f(x) we shall then be able to reflect this in the line of symmetry y = x to draw the 
sketch of f~ { (x). 

In order to sketch y =f(x) we need to find out what it does at all its interesting bits. We 
do this rather than making a table of values because we might choose the x values badly, so 
that what we sketched was just a boring bit, such as a piece of curve which is almost a 
straight line. (Many students panic at this stage, and make it into a completely straight line, 
so finishing up with a total disaster.) 

To investigate the interesting bits, we need to answer the following questions. 

(a) When does f(x) = 0? 

(b) What is the value of f(x) when x = 0? 

(c) Is there any value of x which we can't have because f{x) would be undefined for 
this value? If so, what happens to f(x) when x gets near this forbidden value? 

(d) What happens to f{x) when x becomes very large? 

Test your theory with some large positive and negative values of x. 

Try answering each of these four questions yourself for the function f{x) above which we 
want to sketch. 



x + 3 

(a) f(x) = 0 if - = 0. 

x - 2 

This happens if x = -3. (Notice that we only have to look at the top of the fraction to answer 
this question. However many parts something is divided into, if you get none of those parts 
you've got nothing.) 

We now know that f(x) cuts the x-axis at (-3, 0). 

3 3 

(b) f{x) = when x = 0 so f(x) cuts the j-axis at (0, -5). 

(c) We can't have x = 2 because we can't divide by zero. 

If x is very close to 2, say 1 .999 or 2.00 1 , then (x - 2) is very small, and dividing 
by a very small number gives a very large result. 

Just before x = 2,f(x) is very large and negative, and just after x = 2,/(x) is very 
large and positive. (You can check this on your calculator if you wish.) 

f(x) becomes closer and closer here to the line x = 2. (This line is called a 
vertical asymptote.) 

x + 3 

(d) What happens to y = f(x) = as x becomes very large? 

x - 2 

The easiest way of seeing what must happen here is to divide the top and bottom of f{x) by 
x. This gives us 

x + 3 1 + (3/x) 

fix) = = — . 

x - 2 1 - (2/x) 
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Now, as x becomes very large, (either positive or negative), both (3/x) and (2/jc) will become 
extremely small. The larger x becomes, the tinier they get, and indeed we can make them as 
small as we please by choosing a large enough value of x. (We can't actually make them 
equal to zero because this would require x to be infinitely large and, as we saw with the two 
straight lines in Section l.E.(d), infinite quantities of things behave in strange ways.) 

We see that, as x becomes very large, /(x) will become closer and closer to 1/1 = 1. 

This means that we know that the curve of y = f(x) becomes closer and closer to the 
straight line y = 1 as the values of x become larger and larger. (This line is called a 
horizontal asymptote.) 

We now have enough information to be able to have a good try at sketching this curve. 
First, we draw the two axes and mark on them where the curve crosses them using our 
answers to (a) and (b). Then we draw in the two lines y = 1 and x = 2 which we know the 
curve gets closer and closer to. We then sketch in the curve which seems to fit in best with 
this information. I've done this in Figure 3.B.16. 




Figure 3.B.16 



The only question we can't yet answer is how the slope of the curve changes from point 
to point. Could it perhaps have some kinks and wiggles that we don't know about? Finding 
out how slopes change is the subject of Chapter 8, and in Section 8.E.(c) I shall give you a 
full list of curve-sketching help which will include this. Also, in Section 8.C.(e), we shall 
show that this particular curve must always have a negative slope (except when x = 2). 

For this particular curve, it is also possible to show that its slope is always downhill by 
taking any two points which lie on it which are both either to the left of x = 2 or to the right 
of it. If you then work out the gradient of the straight line joining them, you will find that 
it is always negative. 

This curve is interesting because of another special property. It's only the second one 
we've met which does this particular thing. Can you see what it is? 
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It does a jump. This jump, which happens whenx = 2, is called a discontinuity. Because 
of it, this curve can't be drawn with a continuous pencil line. (The other one like it is 
example (4) at the end of Section 3.B.(g) - in fact, it is very like it indeed. When we've 
finished this graph sketch, I shall show you how to turn this one into that one.) 

Using the fact that the graph of f~ { (x) is the same as the graph of f(x) reflected in the 
line of symmetry y = x, we can now sketch both of these graphs together. 



If you are sketching an inverse function by this method, the best method for 
drawing it convincingly is to turn your paper so that the line y = x is vertical. 
This makes it much easier to get / and f~ { symmetrically placed either side of 
this line. 



I show my two graphs in Figure 3.B.17. 

The two asymptotes of y = f(x) will also be reflected in the line y = x to give the 
corresponding pair of asymptotes of_y =f~ 1 (x). 

Adding your own colours to / and f' 1 and the two pairs of asymptotes x = 2 and y = 1, 
and x = 1 and y = 2 would help you to see exactly what is going on. 




Trie asymptotes /or cur*. C*e UntS X=Z ar»d 



TWe asymptotes for fX*) are IK* lines Jt=l and 2. 
Figure 3.B.17 
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From this graph sketch, you can see the symmetry of the gaps in the domain and range 
of f(x) and / _1 (x) respectively The value 2 is excluded from the domain, the set of possible 
x values for f{x), and also from the range, the set of possible y values for / _1 (x), and the value 
1 is excluded from the range of f(x) and the domain of / _1 (x). 

exercise 3.B.4 Using similar methods to those we used together above, find out as much 

information as you can about the following two functions. 

x - 2 2X - 5 

(1) g(x) (2) h{x) 

x + 4 X + 1 

Use this information to sketch the graphs of the two functions. (Of course, for 
all of this sketching you could just use a graph-sketching calculator - but if you 
answer the questions for each curve like we did in the example, you'll know why it 
does what it does.) 

Find also the two inverse functions, g _1 (x) and 

(3) Sketch the function 
2X + 3 

m 

X - 2 

from question (9) of Exercise 3.B.3 and draw in the line y = x on your sketch. 

Now we find out how to turn y = (x+3)/(x-2) into y = 12/x which was (4) at the end of 
Section 3.B.(g). 

Looking at the sketch ofj = (x+3)/(x-2) in Figure 3.B.16, we can see that, if we move 
the x-axis up by one unit and the j-axis to the right by two units, we shall have transformed 
this sketch into one very similar to the sketch for (4). 

We could think of this as putting Y = y - 1 and X = x - 2. 

We can see this nicely by using algebra. We have 

x+3 x-2+5 5 

y -f(x) = = — = 1 + 

x-2 x-2 x-2 

5 



so y - 1 = 

x - z 

Putting 7 = >>-landX = x- 2 gives Y = 5IX. 

I show its graph sketch below in Figure 3.B.18, with the graph sketch of y = Mix. 




The only difference now is one of scale. If we shrink (b) by a factor of 5/12, we get the 
identical graph to (a). 
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3.B.Q) 



Odd and even functions 

Make sketches for yourself of the graphs of the following four functions. 

(a) y = x (b) y = x 2 (c) y = x 3 (d) y = \x\. 



\x\ means 'take the positive value whatever the sign of x itself. 
What kinds of symmetry do you see in your sketches? Describe them. 



Your four graphs should show two different sorts of symmetry, so giving you examples 
of what are called even and odd functions. 



Even functions 

A function is even if it is symmetrical about the j-axis. 
For these functions, f(x) = f(-x) for any value of x. 



The functions (b) and (d) above are both examples of this. The standard Normal 
distribution, which we talked about in Section 3.B.(e), is also an even function, and it is this 
property which makes it possible to halve the size of the tables needed to work with it. 

The sketches for (a) and (c) show a different sort of symmetry. In each case, if we rotate 
the graph through a half turn about the origin, then it exactly fits onto itself. Put another way, 
turning the page upside down leaves the graph unchanged. 



Odd functions 

A function is odd if rotation through a half-turn leaves it unchanged. 
This is the same as saying that the function reverses its sign if it is 
reflected in the j-axis, so f(x) = -f(-x). 



Figure 3.B.19 shows my sketches of the four graphs for (a), (b), (c) and (d). 




® 3= lx.1 



Figure 3.B.19 



See if you can decide which of (a), (b), (c) and (d) have inverse functions. 
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(a) and (c) will each have an inverse function because each value of y is given by only 
one possible value of x, but (b) and (d) will only have inverse relations. 
With (b) for example, ify = A then x could be +2 or -2. 

If y = x 1 then x = y 1/2 . The inverse relation is x >-» x 1/2 , and x 1/2 can be either + or -. 
The sketch in Figure 3.B.20(a) shows the graphs of y = x 2 and its inverse relation 




Figure 3.B.20 



However, if we say that x cannot be negative, so that we restrict the domain of y = x 2 to 
values of x which are greater than or equal to 0 (which we write as x > 0), then we shall have 
a perfectly good inverse function which is y = yfx. This is shown in Figure 3.B.20(b). The 
symbol -f~ is taken to mean the positive square root only. 

3.C Exponential and log functions 

3.C.(a) Exponential functions - describing population growth 

The functions which we shall look at in this next section are of huge importance to scientists 
and engineers. This is because they describe many physical situations where there is a 
smooth rate of growth which depends on how much of the substance is present at any 
particular time. An example of this is the process by which cell growth takes place through 
the repeated division of individual cells into two new cells. 

To help us to see what is going on in this kind of situation, we'll look at what happens if we 
have a population of cells which doubles in size every hour. We'll suppose that there are 1000 
cells at the time when we start measuring. Then after 1 hour we would have 2000 cells, after 2 
hours we would have 4000 cells, and so on. (We will assume that the growth process is taking 
place as smoothly as possible, so that particular groups of cells don't all double at the same 
instant, and that conditions remain favourable for this continued growth. When the nutrients 
start to run out, this mathematical description of what is happening will break down.) 

We could make the table shown below to show the number of cells present at particular 
instants in time, measured from a starting value of t = 0 when there are one thousand cells. 
(I am using the letter t to stand for time as this is the usual choice.) Then x, the number of 
thousands of cells present, is a function of t. 



t (time in hours) 


-2 


-1 


0 


i 

2 


1 


2 


3 


4 


x (number of cells in thousands) 






1 




2 


4 
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I have left some gaps in the table. Try filling in these for yourself, in the following 
order: 

(a) the numbers of thousands of cells which will be present after 3 hours and after 4 
hours, 

(b) the number of thousands of cells present both 1 hour and 2 hours before the 
measuring started, 

(c) the number of thousands of cells present after half an hour. 



(a) For this, you should have 8000 after 3 hours and 16 000 after 4 hours, giving x = 8 
and x = 16. The rule that gives you these answers is x = 2*. 

(b) For this, you should have x = \ when t = -1, meaning that there were 500 cells 
present 1 hour before measuring started and x = \ when t = -2, meaning there were 
250 cells present 2 hours before the measuring started. These numbers fit in with 
the meanings which we gave to negative powers in Section l.D.(b). 

(c) From Section l.D.(b), too, we take 2 1/2 as meaning ^2 so that there will be about 
1414 cells after half an hour. You should go through this section now if you are 
unsure about these last results. 

I show in Figure 3.C.1 a sketch of what happens if we plot the first seven of these pairs 
of values. 

A. x (number of cells present \v\ thousands) 




>t (time \'n hours) 



Figure 3.C.1 

They appear to form part of a smooth curve, so it would seem reasonable to join them 
up in this way since it shows very well what is happening physically. We could then use the 
curve to read off values for 2* which come between the points which we have plotted. (It's 
worth mentioning very briefly here that if the process of doubling is not smooth, so that it 
goes in definite steps like the numbers of people involved in a game which starts with one 
person picking a partner, and then both these people picking partners and so on, then the 
mathematical description of what is going on will be very different. We shall look at this 
situation in Section 6.C. Then, later on in Section 8.B.(a), we look at what happens if you 
start with stepped time intervals, but then make these intervals smaller and smaller, so that 
you are getting closer and closer to a continuous process - something which is at the heart 
of the maths of the physical world.) 
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Now try answering the following questions yourself. 

(1) How many cells will there be after 5 hours? 

(2) How many cells are there after lj hours? 

(3) How long is it until there are 16 000 cells? 

(4) How long is it until there are 64 000 cells? 



As you answer these four questions, you will probably guess what I'm working towards 
here. The answers go as follows. 

(1) There will be 32 000 cells after 5 hours (that is, 1000 X 2 5 ). 

(2) After \\ hours there will be approximately 2828 cells (that is, 1000 X 2 3/2 ), using 
a calculator for 2 3/2 and giving the answer to the nearest whole number. 

(3) It takes 4 hours to get 16 000 cells because 1000 X 2 4 = 16 000. 

(4) It takes 6 hours to get 64 000 cells because 1000 X 2 6 = 64 000. 

The last two questions are put the other way round from the first two so that, to find the 
answers, you have to go back from a known x to find the t which gave it. In other words, you 
are using the inverse function of x = 2'. 

So what is this inverse function that you are using? 

The answer to this question is so important that it needs a section of its own. 

3.C.(b) The inverse of a growth function: log functions 

This inverse function has to describe 16 = 2 4 giving us the power 4, and 64 = 2 6 giving us 
the power 6. It is the inverse function of x = 2' and we call it log to the base 2. 



If x = f(t) = 2* then /- 1 (?) = log 2 t. 

Because any function and its inverse also work opposite ways round, it is also true 
that if/" 1 (0 = log 2 t then /(0 = 2'. 



I show a sketch of x = 2' and its inverse function of x = log 2 t in Figure 3.C.2. 




Figure 3.C.2 
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We know that these curves work well for giving a description of what is happening 
physically. We can't therefore allow negative roots here, since these would give us points 
which would not lie on the curve of x = 2'. (For example, we don't want x = — y/2 when t = 2 .) 
For this reason we only include positive roots, meaning that our inverse function is safe. 

This means that we can only have logs of positive numbers. 

3.C.(c) Finding the logs of some particular numbers 

Many students find logs rather alarming. They are so important in applications that it's 
important for you not to be scared of them, so now we will look at some particular examples 
of how they actually work. 

We have already seen the particular cases of log 2 (2 4 ) = 4 and log 2 (2 6 ) = 6 from the 
answers to questions (3) and (4) in the previous section. 

We can say that if some number n = 2* then t = log 2 n. 

This means that if we can write any particular number as a power of 2 then it is very easy 
to write down its log to base 2. Here are two examples. 

(1) 128 = 2 7 so log 2 (128) = 7 and (2) 1/8 = 1/2 3 = 2~ 3 so log 2 (1/8) = -3. 

exercise 3.C.1 Some of the questions in this exercise use the special results for powers from 

Section i.D.(b) - you may need to go back to these before you do them. 

(1) Try finding the logs to base 2 of the following yourself, 
(a) 4 (b) 8 (c) 2 (d) 1 (e) \ (f) \ 

(2) Logs to other bases work in exactly the same sort of way. 
For example, 27 = 3 3 so log 3 27 = 3. 

Try finding the logs to base 3 of the following numbers yourself, 
(a) 9 (b) 81 (c) £ (d) \ (e) 1 (f) 3 (g) \ (h) 27 (i) V3 

(3) Now try finding the logs to base 10 of these numbers, 
(a) 100 (b) 1000 (c) 10 (d) 1 (e) ^ (f) 0.01 

Some important points come out of the answers to this exercise. This is the first. 



It is always true that log„ a = 1 and log a 1 = 0 for any base a. 



We'll also widen the definition of logs to a general base, here. 



If x = a' then t = log a x and if t = log a x then x = a*. 



Also, logs to base 10 are given on your calculator, because we count in base 10. This 
means that you can get the same answers to question (3) above by using your calculator - 
do this, just to check. You will need to use the key marked 'lg' or 'log'. (The one marked 
'In' or 'log e ' will give you a different sort of log which I'll come to in Section 3.C.(e).) 
Because logs to base 10 are so common, we don't usually bother to write the little 10 below. 
Your calculator will also give you values for all those in-between points on the smooth curve 
of x = log 10 t where we can't work out the answers in the way we've done the ones above. 
We can't explain mathematically how it does this yet. 
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3.C.(d) The three laws or rules for logs 

In Section l.D.(a) we wrote down the three rules for working with powers. These are as 
follows: 



Rule (1) a m X a" = a 
Rule (2) a m + a" = a 
Rule (3) (a m ) n = a mn 



We showed there that they worked for whole number powers, and said that they do, in 
fact, work for any values of m and n provided that a ^ 0. We can't yet show that this is true 
though at least now we have a mental picture of the graph of x = a' to give us some idea of 
how the intermediate values work. Our next results come from assuming that the three laws 
above are indeed true. 

The special striking property of these three laws of powers is that they make things easier. 
They write a multiplication in the form of an addition, a division in the form of a subtraction, 
and raising to a power in the form of a multiplication. 

Because logs are the inverses of powers, they also have this property of making things 
nicer. Through the three rules for powers, we get the three rules for logs which I have put 
in a box below. 



The three rules for working with logs 
Rule (1) log a (xy) = log a x + log a y 

Rule (2) log a \ - j = log a x - \og a y 

Rule (3) log a (x") = n \og a x 



I will show you through a numerical example how the first rule for logs comes from the 
first rule for powers. 

Suppose we have log 3 (9 X 81). 

Then Rule (1) says that log 3 (9 X 81) = log 3 9 + log 3 81. 

Can we show by using the first rule of powers that the LHS is equal to the RHS above? 
We know that 9 = 3 2 and 81 = 3 4 so we can say that log 3 9 = log 3 (3 2 ) = 2 and 
log 3 81=log 3 (3 4 ) = 4. 

Therefore the RHS = log 3 9 + log 3 81=2 + 4 = 6. 

We can also say that the LHS = log 3 (9X81) = log 3 (3 2 X 3 4 ) = log 3 (3 2+4 ) = 2 + 4 = 6. 
Therefore we have shown that the RHS is equal to the LHS. 

In exactly the same way, suppose we have log a (xy) and we rewrite each of x and y as 
powers of a, so that x = a m and y = a". 

This then means that m = log a x and n = log a y. Then 

log a (xy) = log„ (a m a") = log a (a m + ") (from the first rule) 

= m + n = log a x + log a y. 
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^^^^ We can see from what we have just done that it cannot be true that log,, (x + y) 
^H-^V = log,, x + log,, y (except for the very special case when xy = x + y). 

We can show similarly that log„ (x/y) = log a x - log a y. 
Again, we start by looking at a numerical example. 
Can you show that log 2 (32/4) = log 2 32 - log 2 4? 



We can say that log 2 (32/4) = log 2 (2 5 /2 2 ) = log 2 (2 5 ~ 2 ) = 5-2 = 3. 
Also log 2 32 - log 2 4 = log 2 2 5 - log 2 2 2 = 5 - 2 = 3. 
Therefore the LHS above is equal to the RHS. 

Now we show in a more general way that 




= log a x - log a y. 



We rewrite x as a m and y as a" as we did before. Then log a x = m and log a y = n. So 

log, j = log fl j = log„ (a m -") (from Rule (2)) 

= m-n = log a x - log a y. 
Finally, we look at log a x". 

Taking a numerical example first, can you show that log 2 (8 4 ) = 41og 2 8? 



You can say that 8 4 = (2 3 ) 4 = 2 12 from Rule (3), so log 2 8 4 = log 2 2 12 = 12. 
Also, log 2 8 = log 2 2 3 = 3, so 4 log 2 8 = 4 X 3 = 12. 
Therefore, log 2 (8 4 ) = 4 log 2 8. 

We now show in a more general way that 

log a x n = n \og a x. 

We rewrite x as a m , so m = log a x. 

Now, we have log a (a m ) n = log a (a mn ) (from Rule (3)) = mn = n\og a x. 

A little piece of history 

Before calculators were invented, the multiplication and particularly the division of large 
numbers were very tedious and time-consuming processes. However, it was realised that if 
the numbers could be written as powers of 10, the processes could be converted into addition 
instead of multiplication, and, even better, subtraction instead of division. Books with tables 
of these corresponding powers were published, to use for these calculations. 

You can relive the experience of past days by using logs to divide 231.4 by 27.2. 

First, find the logs of the two numbers on your calculator, then subtract the second from 
the first, and finally do INV log or SHIFT log. You get the result 8.5074 to 4 d.p., an answer 
which you, of course, can obtain far more quickly by simply feeding in the original numbers 
and pressing the = button. Back in those days, finding the logs from log tables and then 
subtracting them was vastly preferable to the alternative of long division. Calculators are a 
great blessing for those faced with complicated arithmetic. 
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For you, the three rules or laws of logs will be of great importance when you are solving 
physical problems. They can be used either for splitting expressions up or for combining 
separate logs together. Being able to rearrange in both directions is important so I will give 
two examples of each. 

In the first two, we split up as far as possible. 

example (1) log 2 8x 2 = log 2 8 + log 2 x 2 = log 2 2 3 + 2 log 2 x = 3 + 2 log 2 x. 

example (2) log 2 (3x 2 /y 3 ) = log 2 (3x 2 ) - log 2 (y 3 ) = log 2 3 + log 2 x 2 - log 2 y 3 

= log 2 3 + 2 log 2 x - 3 log 2 y. 

In the second two examples, we combine as far as possible. 

example (3) log 2 3 + 4 log 2 x = log 2 3 + log 2 x 4 = log 2 (3x 4 ). 

/x 2 +l\ 

EXAMPLE (4) log 10 (X 2 + 1) - log 10 (x 2 - 1) = log 10 I — " I. 

You can't split the insides of the brackets here! 

exercise 3.C.2 (i) Use the rules of logs to split the following expressions up into separate logs 

(or numbers) as much as possible, 
(a) log 3 3x (b) log 3 27X 2 

(c) log 3 (x/y) (d) log 3 (x 2 /a 2 ) 

(e) log 3 {ax") (f) log 3 (9o x ) (g) log 3 (2x + 3/) 

(2) Combine the logs in the following as far as possible, using the laws of logs, 
(a) log 1D x + log 1G (x - 1) (b) 2 log 1Q x - log 10 y 

(c) log 1D (x + 1) - log 1G (x - 1) (d) 3 log 10 x + 2 log 10 y 

3.C.(e) What are 'e' and 'exp'? A brief introduction 

In the physical example of cell growth in Section 3.C.(a), the number of cells present at any 
particular time t was given by the equation x = 2*. Also, the rate of increase of this number 
of cells was directly proportional to the number of cells present at any particular time. Using 
the ideas of Section 3.A.(a), we could say that 

the rate of increase = k (the number of cells present) 

where k is some constant. (We aren't yet in a position to work out the value of this constant 
- this has to wait until Section 8.F.(d).) 

The special and particular property of the number e is that the rate of growth at any 
instant of a quantity x given by x = e' is actually equal to x itself. The constant of 
proportionality, k, is equal to 1, which greatly simplifies many situations. We can't go into 
what this will mean mathematically until Section 8.B, but because functions involving e are 
of central importance in describing many physical processes, you are likely to meet them 
early on in your course. This is why I'm putting in this brief introduction for you here. 

The value of e lies between 2 and 3, and its value to 3 d.p. is 2.718. (It is a number like 
71 which cannot be written with an exact numerical value.) 

The curve of x = e* lies between the curves of x = 2* and x = V. I show this in 
Figure 3.C.3. 
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Figure 3.C.3 



Notice that all the curves pass through the point (0,1), because 2 0 = e° = 3° = 1. 

You may sometimes see e* written as exp(f). (The 'exp' is short for 'exponent'.) This 
notation is particularly useful if you have a complicated power of e because it makes it much 
easier to read than the tiny writing of a power. 

^^^^ The word 'exp' is also sometimes used by calculators when they display very 
^B-^V large or very small numbers in scientific notation. For example, 314 000 
might be displayed as 3.14 EXP 5, meaning 314 000 = 3.14 X 10 5 , or 
0.00176 might be displayed as 1.76 EXP -3, meaning 0.00176 = 1.76 X 10~ 3 . 

When 'exp' is used like this, it is referring to powers of 10 not e. 



Calculators also sometimes use a gap instead of putting 'exp' when they are displaying 
numbers in scientific notation. They may also write the power of 1 0 raised above the level 
of the number. It is important for you to know how your own calculator does this. If you are 
at all unsure, put in (600 000) 2 . This is 3.6 X 10 11 in scientific notation, and you will be able 
to see just how your calculator displays the 3.6 and the 11. (Your calculator will display this 
number in this way because it is too large for the conventional display.) 

Logs to base e are written as 'In' or Tog e '. They are often shown as 'In' on calculators. 
Because the behaviour of e* and therefore of In t is so special, these logs are often called 
natural logs. We can say 



if x = 


e* then t = In x 


and 




if t = 


In x then x = e'. 
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One example of how e creeps into physical laws is given by the value of the constant k 
which we referred to at the beginning of this section. We shall show in Section 8.F.(d) that 
k = In 2. 

I show a sketch of x = e* and its inverse function of x = In t in Figure 3.C.4. 



A3C 




= *n t 



Figure 3.C.4 



If you plot the curve of y = e x as accurately as possible on graph paper, 
taking values of x between 0 and 4 inclusive, you will be able to see more 
clearly how the curve builds up. (You can fill in as many intermediate 
points as you wish, using the e x button on your calculator. The curve of 
y = e x is exactly the same as that for x = e\ We are just using different 
letters.) 

You will see that the steepness of the curve is changing smoothly as the 
value of x increases. Clearly this is a very different situation from the 
graphs of straight lines where the steepness, or rate of change of y with 
respect to x, remains the same, and they have a constant gradient. 

Can you think of a way of estimating the steepness or rate of change of 
the curve of y = e x when x = 1.5, by drawing in a straight line and 
finding its gradient? (If you choose different scales on the two axes, be 
careful to allow for this when you find the gradient of the line.) 

What answer do you expect to get? 



3.C.(f) Negative exponential functions - describing population decay 

The situations represented by the graphs of x = 2* and x = e* are examples of what is called 
exponential growth. 

What would the graphs of x = 2~' or x = e~' represent? 
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I show some values for x = 2 1 in the table below. 



t 


-3 


-2 


-1 


0 


1 


2 


3 


4 


X 


8 


4 


2 


1 


i 

2 


l 

4 


i 

8 


1 

16 



You will see that the values match those of the table on page 1 14 except that they have 
been switched either side of t = 0. 

I have drawn a sketch of the graphs of x = 2~' and x = 2' together on the same axes in 
Figure 3. C. 5(a). This shows that they are mirror images of each other in the vertical axis. 

In Figure 3. C. 5(b), I have sketched the two graphs of x = e 1 and x = e~\ These, like all 
similar pairs of equations, also form a pair of mirror images of each other in the vertical 
axis. These mirror images will always intersect each other at the point (0,1) since a 0 = 1 for 
all non-zero values of a. 




Figure 3.C.5 

^^^^ Don't confuse the graph of x = with the graph of x = - e l . The second of 
^B-^V these is the same as the graph of x = e T except that every value of x has now 

become negative. Therefore it is the same as the graph of x = e' reflected in 

the horizontal axis. 



The graph of x = 2~' could represent the radioactive decay of 1 tonne of a substance with 
a half-life of one hour. (This means that during each hour the mass of the substance becomes 
half what it was at the beginning of that hour. The total mass of substance present will 
probably not change very much since most radioactive elements decay into another element 
with a very similar mass.) The left-hand side of the graph then shows the mass of the 
substance present at various times before the instant when we started measuring. These times 
therefore have negative values. 
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This graph represents what is called the exponential decay of the substance. 
We shall look at this kind of situation in more detail in the first example in Section 
9.C.(b). 

3.D Unveiling secrets - logs and linear forms 

The use of logs gives us an extremely powerful method for analysing experimental 
results to reveal underlying physical laws of relationship. This section describes how this 
works. There are some practical applications of these methods to physical examples in 
Section 9.C.(b), where we look at how we can solve some equations involving rates of 
change. 

3.D.(a) Relationships of the form y = ax" 

Suppose that we have a table of pairs of experimental measurements x and y, and we suspect 
that there is a relationship between x and y of the form y = ax", where a and n are two 
constants which we want to find out. 

If our suspicion is correct, and we plot the points given by the pairs on graph paper, we 
will find that they appear to lie on or close to a curve similar to the sketch I have shown in 
Figure 3.D.1 (unless n = 1 when we will have the straight line y = ax). 



\3 




Figure 3.D.1 



But this curve will take us no further forward since we can't see from it what its equation 
is, and so we can't find out from it what a and n are. 

However, we know that we can get information from a straight line. If we have a straight 
line with the equation y = mx + c then m is the gradient of the line, and c is its y intercept. 
(Look in Sections 2.B.(d) and (e) if necessary.) 

If we can somehow convert the curve into a straight line, we shall be able to read useful 
information from it. 

How can we do this? 



We can take logs of both sides of the equation y = ax". We do this usually either to base 
10, or by finding natural logs (i.e. to base e), since these are the two possibilities given on 
calculators. In my example, I use logs to base 10. 

Then we use the laws of logs to write this new equation in a simpler form. 
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These three laws or rules of logs come in Section 3.C.(d). As we shall be using them a 
lot here, I have put them in again for you. 



The three laws of logs 

\o%(ab) = log a + log b 
\o%(a/b) = log a - log b 
log(a") = n log a 



To fit any of these laws, all the logs involved must be taken to the same base. 
^B^B Remember that log(a + b) is not equal to log a + log b. 

™ 

If we take logs on both sides of the equation y = ax", we get 

log y = log(ax") = log a + log(x") = n log x + log a. 

Now we compare this with the equation of a straight line, Y = mX + c. 
I've put this in a box for you as it is important. 



Finding a linear form for y = ax" 

Taking logs gives log y = n log x + log a. 

Comparing this with Y = mX + c gives 

Y = log y, X= log x, m = n and c = log a. 



So we can now see that if the physical relationship is of the form y = ax" then we should 
get an approximate straight line if we plot log y against log x. (I say 'approximate' because 
if these are experimental values there is likely to be some error in the measurements.) 

Drawing a line of best fit through the points will give us something similar to the 
sketch I have shown in Figure 3.D.2(a). The reason for drawing this line of best fit is that 
it evens out the inaccuracies as much as possible since it uses all the data that we have. 
Trying to calculate an equation from just two of the pairs of values which we found from 
taking the logs would be less accurate. Sometimes you may draw this line in by eye, or 
in some cases you may do the job more accurately by finding a regression line, in which 
case you will be able to write down the values for c = log a and m = n immediately from 
its equation. 

If you have drawn a line of best fit by eye, you will now have to use it to find your c and 
m, so I will explain to you next how you would do this from your graph. 
This graph will look similar to my drawing of Figure 3.D.2(a). 

In Figure 3.D.2(b), I show a sketch on which I have put some numerical values, so that 
I can more easily explain to you the process for the next stage. 
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Figure 3.D.2 



Firstly, we use the graph to find the value of c. This is given by reading off the value of 
the ^intercept. This gives us c = log a in (a) and c = log a = 1.8 in the numerical example 
of (b), so a = 63 to 2 s.f. 

Secondly, because we now have a straight line, we can find its gradient by using any two 
points lying on the line. (This is explained in Section 2.B.(d).) Because this is a line of best fit, 
it may be that neither of these points corresponds to an actual pair of plotted measurements. 

The gradient is given by PRIQR in (a), and 2.4/0.8 = 3 giving n = 3 in (b). 
The graph of Figure 3.D.2(b) would give us the result that the pairs of measurements x and 
y are linked by the relationship y = 63x 3 . 

^^^^ Remember that you must take account of the scales that you have used on 
^H-^V your horizontal and vertical axes when you work out the gradient of your 
line. You can't do it simply from the graph paper squares. 



Dealing with a possibly tricky situation 

In order to make the best use of the pairs of measurements that you have, it is often better 
to use only the parts of the scales which cover the range of your measurements, rather than 
showing the entire scale from zero at the origin. The convention for showing that you have 
done this is to use a zig-zag at the origin as I have done on my X-axis in Figure 3.D.3. 




Figure 3.D.3 
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It's quite easy to find the gradient of the line here, as it is 
2.1 - 1.5 1 
10.52 - 8.12 4' 

^^^^ The tricky bit is finding the ^intercept correctly. It isn '( 1 .2 because of the 
^H-^V break in the x-axis which means that it is not true that Y = 1.2 when X = 0. 

But, since we now know that the gradient of the line is \, we know that its 
equation is Y = \x + c. 

We also know that 7= 1.5 whenX= 8.12, so c = 1.5 - 2.03 = -0.53. 

But c = log a so a = 0.295 and we have the equation linking the 
measurements as y = 0.295x 1/4 . 



3.D.(b) Relationships of the form y = an x 

Suppose we have a table of pairs of experimental measurements x and y, and this time we 
suspect there is a relationship between them of the form y = an x where, as before, a and n 
are two constants for which we want to find the value. 

Just like last time, if this relationship is true, plotting y against x will give us a curve from 
which we can obtain no further information except that there does seem to be some form of 
relationship. 

Try taking logs of both sides of the equation y = ax" yourself, and see if you can 
work out what we should make X and Y be so that we get a straight line when we plot 
Y against X. 



Taking logs of both sides of the equation y = an x , you should have 

log y = log(a« x ) = log a + log(« x ) = x log n + log a. 

I've put the next part of the working in a box for you, so that it is easy to refer to when you 
need it. This is what you should have found. 



Finding a linear form for y = an x 

Taking logs gives log y = x log n + log a. 

Comparing this with Y = mX + c gives 

Y = log y, m = log n, X = x and c = log a. 



Therefore, plotting Y = log y against X = x should give us a straight line if our suspicion 
is correct. 

Doing this will give us a sketch similar to Figure 3.D.4(a). 
Again, I have shown a numerical example in Figure 3.D.4(b). 
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Figure 3.D.4 




From Figure 3.D.4(a) we have c = log a and m = log n = PRIRQ. 
From Figure 3.D.4(b) we have log a = 2.3 so a = 200 to 2 s.f. and m = log n = <, 
so «= 1.3 to 2 s.f. 

This would mean the original relationship in this case was y = 200(1. 3 X ). 

If we do not know which of these forms the relationship has, then it would be sensible 
to try both log y against log x, and log y against x, in the hope of getting a straight line. 

It is possible to do this by using special log/linear or log/log graph paper, which saves you 
having to do the logging yourself. 

The log scales are in powers of 10 called cycles, so you would choose the number of 
cycles according to the range of measurements you need to cover. For example, if this range 
runs from 27 to 1540, then you would need the three cycles 10-100, 100-1000 and 
1000-10 000. 

3.D.(c) What can we do if logs are no help? 

Unfortunately, it isn't possible to bring all relationships to a linear form by taking logs both 
sides. 

For example, if we suspect a relationship of the form y = a + bx 2 , taking logs both sides 
does not help us since log(a + bx 2 ) cannot be split up, and so the values of a and b will 
remain hidden inside the log. 

^^^^ It isn't true that log (a + bx 2 ) is the same as log a + log(/?.v 2 ). 

^■^^ If you think this should be true, go quickly back to Section 3.C.(d) and sort 
out these risky ideas. 



All is not lost in the search for the values of a and b. 

If you compare y = a + bx 2 with Y = mX + c, what could you choose for Y and X for the 
points to lie on a straight line? 

How would you then find the values of a and b from this straight line? 
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Plotting Y = y against X = x 2 will give a straight line if the relationship is y = a + bx 2 . 
In this case, a is the y intercept, and b is the gradient of this line. 

This may seem surprising so I will show you that it works by taking the example of 
7 = 3 + 2x 2 (which you will recognise gives the left-hand sketch of Figure 3.D.5(a)). 

Plotting y against x 2 from the table of values in Figure 3.D.5(b) gives the straight line 
shown in Figure 3.D.5(c). 




This straight line has aj intercept of 3 and its gradient is (1 1 - 3)/4 = 2, so a = 3 and 6 = 2, 
giving us the equation we know we should have of y = 3 + 2x 2 . 

If you suspected a relationship of the form (I) y = a + bx 3 or (2) y = a + b^x what would 
you plot in each case in order to get a straight line if your theory is correct? 



For (1), you would try plotting values of y against values of x 3 . 

For (2), you would try plotting values of y against values of yfx. 

You will see that the problem we have here is that, in order to get the straight line, we 
need to know what power of x is involved. In the first example which we looked at, the logs 
took care of that problem for us. 
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4 Some trigonometry and geometry 
of triangles and circles 

This chapter reminds you of what trig is for, and how it works in triangles. It also 
explains some of the special geometrical properties of triangles and circles, because 
they may be very useful to you in applications of maths to your own special subject 
area. 

The chapter is divided into the following sections. 

4.A Trigonometry in right-angled triangles 

(a) Why use trig ratios? (b) Pythagoras' Theorem, 

(c) General properties of triangles, (d) Triangles with particular shapes, 

(e) Congruent triangles - what are they, and when? 

(f) Matching ratios given by parallel lines, 

(g) Special cases - the sin, cos and tan of 30 0 , 45 0 and 6o°, 

(h) Special relations of sin, cos and tan 

4.B Widening the field in trigonometry 

(a) The Sine Rule for any triangle, (b) Another area formula for triangles, 
(c) The Cosine Rule for any triangle 

4.C Circles 

(a) The parts of a circle, (b) Special properties of chords and tangents of circles, 

(c) Special properties of angles in circles, 

(d) Finding and working with the equations which give circles, 

(e) Circles and straight lines - the different possibilities, 

(f) Finding the equations of tangents to circles 

4.D Using radians 

(a) Measuring angles in radians, 

(b) Finding the perimeter and area of a sector of a circle, 

(c) Finding the area of a segment of a circle, 

(d) What do we do if the angle is given in degrees? 

(e) Very small angles in radians - why we like them 

4.E Tidying up - some thinking points returned to 

(a) The sum of interior and exterior angles of polygons, 

(b) Can we draw circles round all triangles and quadrilaterals? 



4.A Trigonometry in right-angled triangles 

4.A.(a) Why use trig ratios? 

When you began learning trigonometry (often referred to as 'trig'), you will have started by 
working with right-angled triangles. Since my policy is to make sure of the groundwork for 
each topic before going further, I will start from here, too. 

We begin by looking at the right-angled triangle ABC shown in Figure 4.A.I. 
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A 

Figure 4.A.1 



c 



We will describe the sides of this triangle by their position relative to the angle at^. 
BC is the side opposite to angle A (opp. for short). 
AC is the side adjacent to angled (adj. for short). 
(The word 'adjacent' means 'lying next to'). 

AB is the longest side, opposite to the right angle. It is called the hypotenuse (hyp. for 
short). 

Then we give particular names to each of the ratios of the different pairs of sides. We say: 

BC opp. AC adj. BC opp. 

sin A = = , cos A = = , tan A = = . 

AB hyp. AB hyp. AC adj. 

To do the thing thoroughly, the ratios obtained by turning the above three ratios upside 
down are also given names as follows: 

1 AB I AB I AC 

= = cosec^, = — = secA, = = cot^. 

sin ,4 BC cos A AC tan A BC 

These three ratios are the reciprocals of the first three ratios. 

(Sin, cos, tan, cosec, sec and cot are all shortened versions of longer names which are 
relatively rarely used. They are, in the same order, sine, cosine, tangent, cosecant, secant and 
cotangent.) 

The question now is why did anyone think these different ratios so important that they 
ought to be given special names? We can see the answer to this by looking at the triangles 
in Figure 4.A.2 which are nested into each other because they are the same shape. Only their 




A 4 C E G- 

Figure 4.A.2 
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size is different. Triangles ADE and AFG are enlargements of triangle ABC. It is as though 
triangle ABC is stretched out into these larger triangles under a constant pull, so that all the 
proportions stay the same. (If it is some time since you did any trig, you may find that it 
helps you to draw in the outlines of the three triangles in three different colours.) 

From the lengths shown on the triangles, how long will the sides AE, AD, AG and 
AF be? 



Triangle ADE has sides which are all twice as long as triangle ABC, since it is just a 
scaled-up version of triangle ABC. So AE = 8 and AD = 1 0 units long. 

Similarly, triangle AFG is scaled up by a factor of 4, so AG = 1 6 units and AF = 20 units 
long. 

Next, we write down the values of sin A, cos A and tan A in these three triangles. 
I have left some blank for you to fill in because you will then see why they are so 
important. 

In A ABC, 

3 4 3 

sin A = — , cos A = — , tan A = — . 

5 5 4 

In A ADE, 

6 3 8 3 
sin A = — = — , cos A = — = — , tan A = — = — . 

10 5 5 8 

In A AFG, 

12 3 

sin A = — = — , cos A = — = — , tan A = — = — . 
20 5 



We see that the fractions or ratios giving the sin, cos and tan of angled remain the same, 
although the sizes of the triangles are different. It is this property of remaining constant for 
a given angle, whatever the scale of the triangle that the angle is in, which makes these 
ratios so important. 

Practically, it makes it possible to find heights or depths in situations where we can't 
make these measurements directly. For example, if we wish to find the height of a tree, it can 
be done by measuring the distance to the foot of the tree, and the angle of elevation E to the 
top of the tree. We can then use the tan of this angle of elevation to find its height. 




< XO m > 

Figure 4.A.3 
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In the case shown in Figure 4.A.3 we would have: 



H 

tan 38° = — so H = 20 tan 38° = 15.6m to 1 d.p. 
20 



There are two standard ways of measuring angles. They can be measured in 
degrees, where 90° is a right angle, as shown in Figure 4.A.4 below. Then 
180° is a straight line, and 360° is a full turn. 

<?0° 



ISO 



f 


> 




J 



360° 



Figure 4.A.4 



Angles can also be measured in radians which are described later on in 
this chapter in Section 4.D.(a). 

There is a third way of measuring angles on your calculator (called grad), 
which is very rarely used. 

The ratios for any sin, cos or tan are programmed into your calculator so 
that you can then use them to find either unknown angles, or the lengths of 
unknown sides of triangles. 



Here's a quick revision of how the working out goes, just in case you haven't used it for 
some time. 

example (l) Find the length of PR in triangle PQR, in which the length of QR = 
5 cm and the angle P is 32°. I show a sketch of this in Figure 4.A.5. 

R 
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example (2) Find angle b in triangle ABC in Figure 4.A.6, if AB = 7m and 
BC = 4 m. 




Figure 4.A.6 

We have cos b = 4/7 so b = 55.2° to 1 d.p. (using INV cos or 
SHIFT cos or 2nd/F cos on the calculator to find the angle with the 
known cos). This angle is cos -1 (4/7), where cos -1 stands for 'the angle 
whose cos is'. (We shall look at this in more detail in Section 5.A.(g).) 

exercise 4.A.1 For completeness, I have included this exercise on finding angles and lengths of 

sides in right-angled triangles. If you are at all unsure that you remember how to 
do these, this exercise gives you something to check against. 

(A) If the sketches in Figure 4.A.7 all show triangles with lengths given in 
centimetres find the lengths of the sides marked with a letter to 2 d.p. 




Figure 4.A.7 

(B) Find the marked angles in these triangles giving your answers in degrees to 
one decimal place (Figure 4.A.8). 




Figure 4.A.8 
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Comparing the areas of the triangles in Figure 4.A.2 

Returning to the three nested triangles of Figure 4.A.2, we know that the lengths of the 
matching sides go in the ratio of 1 : 2 : 4 as we move from the smallest triangle to the largest 
triangle. 

How do their areas compare? Do they also go 1:2:4? 




Figure 4.A.9 



Each triangle is half a rectangle as you can see from Figure 4.A.9. Using A to stand for 
'triangle', we have 

A ABC = 5X4x3 = 6 square units, 

A ADE = 5X8X6 = 24 square units, 

A AFG = 5X16x12 = 96 square units. 

The ratio of the areas is given by 



A ABC : A ADE : A AFG = 6 


24 


96 


= 1 


4 


16 


= l 2 


2 2 


42 



The ratio of the areas is the same as the ratio of the lengths squared which makes sense 
as the area is found from multiplying two lengths together. So, for example, if each length 
has been doubled the area will be four times larger. 

4.A.(b) Pythagoras' Theorem 

You will almost certainly have recognised the smallest triangle in Figure 4.A.2 as having 
sides of the smallest whole numbers which fit Pythagoras' Theorem. This says that the 
square on the longest side (or hypotenuse) of a right-angled triangle is equal to the sum of 
the squares on the other two sides. 

(In this particular case, we have 5 2 = 3 2 + 4 2 .) 

The ancient Egyptians knew that they could use a 3, 4, 5 triangle to give them a square 
corner to true their buildings. 
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We can see that Pythagoras' Theorem must be true for any right-angled triangle from the 
pair of drawings in Figure 4.A.10. 




Figure 4.A.10 



This beautiful visual proof was first given in an old Chinese text. 

It is based on the symmetry of the four triangles all sitting on the sides of the square on 
their longest sides so that together they form a larger square. The larger square is then 
rearranged to give the same four triangles and the two squares on each of the shorter 
sides. 

A similar proof by rearrangement was given by the twelfth-century Hindu mathema- 
tician, Bhoskara. Underneath his drawing he wrote the single word 'Behold!'. 

Two other examples of right-angled triangles in which the sides are whole numbers are 
given by 5, 12 and 13 units, and 8, 15 and 17 units, because 5 2 + 12 2 = 13 2 and 8 2 + 15 2 
= 17 2 . 

Sets of three whole numbers like these are called Pythagorean triples, and there are, in 
fact, infinitely many of them. In the huge majority of cases, however, the sides of right- 
angled triangles are not all exact numbers, and therefore involve those irrational numbers 
like ^1 which caused Pythagoras such distress. (See Section l.E.(d).) 

Pythagoras' Theorem can be used to calculate the length of the third side of any right- 
angled triangle if we know the other two. 

Here are two examples. 

In each of the two triangles in Figure 4.A.1 1 find the length of the third side. 




Figure 4.A.11 

In (a), h 2 = l 2 + 24 2 = 49 + 576 = 625 so h = 25 units. 

In (b), 10 2 = y 2 + I 2 so 100 = y 2 + 49 and y 2 = 51 so y = 1AA to 2 d.p. 

exercise 4.A.2 Find the lengths of the third sides of each of the four triangles from Exercise 4.A.1 

part (B). 
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4.A.(c) General properties of triangles 

We have just seen that right-angled triangles have a remarkable special property. Do all 
triangles have special properties regardless of their shape? 

The most important property held in common by all triangles is that their interior angles 
always add up to 180°. 

This can be seen from the drawing shown in Figure 4.A.12. 




We start with any triangle ABC, and then draw the line CE so it is parallel to AB. (The 
two arrows on AB and CE are to show that these lines are parallel.) 

Then the two angles marked a exactly slot into each other, and so do the two angles 
marked b. 

a + b + c makes a straight line, and so adds to 180°. 
Therefore, the angles of the triangle must also add up to 180°. 

We also see from this same diagram that, if we have a triangle with one side extended, 
then the exterior angle e is equal to a + b, the sum of the two interior opposite angles. 
This is shown drawn in on Figure 4.A.13. 




4.A.(d) Triangles with particular shapes 

Triangles can come in an infinite variety of shapes, but there are two particular types which 
have specific names. 

If a triangle has two sides equal then it is called isosceles (originally by the Greeks who 
were very keen on geometry - 'iso' means 'equal' and 'sceles' means 'sides'. 
'Trigonometry' also comes from the Greeks - 'trigono' is the Greek word for triangle.) 
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The two equal sides give these triangles a line of symmetry, so that one half folds exactly 
on to the other half, and the pair of angles opposite the equal sides are also equal. The line 
of symmetry divides the triangle into two equal right-angled triangles. (See Figure 
4.A. 14(a).) The little dashes are there to mark the two equal sides. 




isosceles t <^i |ctter-a( 

Figure 4.A.14 



If a triangle has all three sides equal then it is called equilateral. Such a triangle is 
pictured in Figure 4.A. 14(b). 

It will have three lines of symmetry as shown, and will fit exactly onto itself three times 
in a complete turn. Therefore all its angles are equal, and so must be 60° each. 

All equilateral triangles can nest into each other, in any chosen corner. 

Some are shown here in Figure 4.A.15. 




Figure 4.A.15 



They are all similar to each other. ('Similar' in maths doesn't just mean 'more or less the 
same as' but 'an exact scale model of so that all the angles remain the same, and the pairs 
of sides are all in the same proportion.) 

4.A.(e) Congruent triangles - what are they, and when? 

If two triangles are exactly the same size and shape so that they can be fitted onto each other 
exactly, they are called congruent. In this case, they will obviously have three equal pairs 
of angles and three equal pairs of sides. (It may be necessary to lift one triangle out of the 
paper, and turn it over, in order to fit it exactly on top of the other one.) 

How many measurements (and which ones) do you need to know are the same in order 
to be sure that two triangles must be congruent? 

In general, three pairs of equal measurements will be enough, provided that they are the 
right pairs. See how many of these you can find - draw little sketches if necessary! (Things 
are not always what they seem.) 
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Case (1) We have already seen that having three pairs of equal angles certainly isn't 
enough. This would just mean that the triangles were similar. 

Case (2) On the other hand, having three pairs of equal sides is certainly sufficient. The 
triangles will then exactly match. 

Case (3) If we have two pairs of equal angles, then the third pair of angles must be equal 
since the angles of a triangle add to 180°. Just one pair of equal sides opposite same-sized 
angles is then enough to tell us that the scale is the same, and so the two triangles are 
congruent. 

Case (4) If we have two pairs of equal sides and one pair of equal angles, then it all 
depends where the angle is! You can see the danger in Figure 4.A.16. We are only safe if the 
angles are between the matching sides (except for one case when it doesn't matter where the 
matching pair is . . .). 




These are congruent these ana. not! 

Figure 4.A.16 



Case (5) This special case is when the two equal angles are both right angles. 

In practice, similar and congruent triangles often appear at a slant to each other. 

One example of this is shown in Figure 4.A.17 below. The two congruent triangles shown 
here, with one of them turned through 1 80° relative to the other one, fit together to form a 
parallelogram. 




Figure 4.A.17 



If the two triangles are isosceles, as shown in Figure 4. A. 18(a), then together they make 
what is called a rhombus or diamond. 




Figure 4.A.18 
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By showing the two axes of symmetry set horizontally and vertically, we see why this 
shape is called a diamond, and also that the diagonals cut at right angles. 
This is shown in Figure 4.A. 18(b). 



• What do you get if you add up all the interior angles shown in this 
drawing of a six-sided figure? (See Figure 4.A. 19(a)). Does it depend on 
its shape? 

• What is the sum of its exterior angles? (See Figure 4.A. 19(b).) 

• What would be the sum of the interior angles if the figure had n sides? 
(It would then be what is called an «-sided polygon.) 

• What would be the sum of its exterior angles? 

See if you can work out the answers yourself to these four questions. 
(I give solutions later on in the chapter for you to check against.) 




Figure 4.A.19 



4.A.(f) Matching ratios given by parallel lines 

Here is another useful property of similar triangles. 

Suppose we have two similar triangles nested into each other. 
This is shown in Figure 4.A.20. 




Figure 4.A.20 



Then BC is parallel to DE. This is shown in the diagram by using little arrows. 
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Because the triangles are similar, corresponding pairs of sides are in the same proportion, 
so we have 



AD AE DE 
AB AC BC ' 
But AD/AB = AE/AC can be written as 
AB + BD AC+ CE 



AB 



AC 



Also 



AB 



AB 



BD BD 

= 1 + 



AB 



and 



AC + CE 



Therefore 



BD 
AB 



CE 
AC 



or, equally, 



AB 
BD 



AC 

AC 
CE 



= 1 



CE 
AC' 



turning both fractions upside down if we prefer them that way. 

You will find that this property of parallel lines cutting off sections with the same ratio 
is often very useful when working with problems involving similar physical shapes. 

4.A.(g) Special cases - the sin, cos and tan of 30 0 , 45 0 and 6o° 

It is often useful to know the ratios of the sides of right-angled triangles which have 
particularly simple divisions of 90° for the other two angles. 
The two most useful ones are as follows: 

(a) the ratios for all triangles which have angles of 90°, 45° and 45°, 

(b) the ratios for all triangles which have angles of 90°, 60° and 30°. 

(a) The 90°, 45°, 45° triangle is isosceles. 

The simplest example is the one which has two equal sides of 1 unit, shown in 
Figure 4.A.21(a). 





Figure 4.A.21 



By Pythagoras, h 2 = l 2 + l 2 = 2 so h = ^2 so we have 



1 1 

sin 45° = cos 45° = — and tan45° = -=l. 

V2 1 
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(b) The 90°, 60°, 30° triangle is half of an equilateral triangle, so if we take 2 units for 
each side, the base is conveniently divided into 1 unit for each side. 
A sketch of this triangle is shown in Figure 4.A.21(b). 
Again, we can find the vertical height by using Pythagoras' Theorem. 
We have 2 2 = l 2 + y 2 so y 2 = 3 and y = yf3. This gives us 



sin 60° = cos 30° = 


V3 
2 


and 


cos 60° = 


1 

sin 30° = - 
2 


tan 60° = 


V3 


and 


tan 30° = 


1 

7T 



You will find that these exact values do also check with the decimal values given 
on your calculator for these angles. (Make sure of this for yourself.) 



4.A.(h) Special relations of sin, cos and tan 

Are there any relationships between the sin, cos and tan of the two angles a and b which will 
be true in any right-angled triangle? 

Use the triangle shown in Figure 4.A.22 below to write down the sin, cos and tan of a and 
b. Then see if you can find any connections between them. 




X 

Figure 4.A.22 



You should have found the following relationships. 

b = 90° - a because the angle sum of the triangle is 180°. 

y x ■ , j 1 

sin a = — = cos b, cos a = — = sin b, tan a 



h h x tan b 

We see also that 



sin a ylh y sin b 

= = — = tan a and = tan b. 

cos a xlh x cos b 
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We also find a very nice relationship between the sin and cos of each of a and b which 
comes directly from Pythagoras' Theorem. We have 



x + y = h so 



x 2 y _ h z 
~h 2+ ~h 2 ~~h 2 



= 1. 



But 



2 2 

y x L 

— = sin a and — = cos a 

h 2 h 2 



sin 2 a + cos 2 a = 1. 



This is an enormously useful result and it is worth surrounding its box with bright 
colour. 

It is, of course, equally true that sin 2 b + cos 2 b = 1 . Indeed all the special relationships 
which we have shown above will carry through truthfully when we move on to consider 
general angles instead of just being restricted to angles between 0° and 90°. 



0 



sin 2 a is the usual way that (sin a) 2 is written. Equally, cos 2 a means 
(cos a) 2 etc. 



6 



sin 2 a is not the same as sin(a 2 ). For example, if a = 5°, then sin a = 
0.0872 to 3 s.f. and sin 2 a = 0.00760 to 3 s.f. but sin(a 2 ) = sin 25° = 0.423 
to 3 s.f. 



The last result which we found above has two offspring which are also often very useful. 
We start with 



sin 2 a + cos 2 a = 1. 
Dividing through by cos 2 a we get 
sin 2 a cos 2 a 1 



(1) 



cos 2 a cos 2 a cos 2 a 



so 



tan a + 1 = sec a. 



(2) 



Starting again from (1), and dividing through by sin 2 a, what do you get? 
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sin 2 a cos 2 a 



sin 2 a sin 2 a 



sin 2 a 



so 



1 + cot 2 a = cosec 2 a. 



(3) 



It's also worth surrounding (2) and (3) in bright colour. 



4.B 



Widening the field in trigonometry 



4.B.(a) The Sine Rule for any triangle 

We are now in a good position to get trig formulas for any triangle, which we will then be 
able to use to find unknown angles and sides. 

We start this process by finding what is called the Sine Rule. 



A 




< a > 

Figure 4.B.1 



I have drawn a general-shaped sort of triangle in Figure 4.B.I. I have labelled the sides 
with the lower case letter corresponding to the capital letter of the opposite angle. (This is 
the usual way in which such labelling is done.) 

I've also drawn in the perpendicular line AH (so that we shall have two right-angled 
triangles to work from!). I have labelled its length h. 

Then, in AABH, 

h 

sin B = — so h = c sin B. 

c 

Write down for yourself the same sort of thing for sin C in AAHC. 



You will have 



h 

sin C = — so h = b sin C. 



So we can say c sin B = b sin C. Therefore, 
c b 



sin C sin B 

We could equally have drawn the triangle in such a way that we used A and a. 
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Therefore, by symmetry, we have 



The Sine Rule 

a b c 

sin A sin B sin C 



This applies to any triangle, and we can use it to calculate the lengths of unknown sides 
and angles. 

Here is an example of this. 

In triangle ABC, LB is 58°, ZC is 40° and the side^C is 6 m long. Calculate the lengths 
of the unknown sides and angles. 

We start by drawing a sketch. A sketch is important in any geometrical or physical 
problem, because it gives you some idea of what you are looking for. 



A 




Figure 4.B.2 

My sketch is Figure 4.B.2. I have labelled it in the same sort of way that I labelled the 
original triangle. Also, although it is not accurate, I have tried to make it believable, so that 
the angles of 58° and 40° are roughly the right size. 

So now we start. What is LAI 



It is 180° -58° -40° = 82° because the angles of a triangle add to 180° (Section 
4A.(c)). 

Now, to find a, we have 

a 6 6 

= so a = sin 82° X = 7.01 m to 2 d.p. 

sin 82° sin 58° sin 58° 

To find c, we can say 

c 6 

= so c = 4.55 m to 2 d.p. 

sin 40° sin 58° 

(It is safer not to use the newly found length of a to find c just in case it has a mistake in 
it.) 

Finally, before going on, we look at the sketch to see if our answers seem reasonably 
convincing for this particular triangle. They do, so we can proceed happily to the next thing, 
which is an exercise on using the Sine Rule. 

4.B Widening the field in trigonometry 147 



exercise 4.B.1 Find, if possible, the missing sides and angles in each of the three triangles 

whose measurements are given below, giving the angles in degrees to 1 d.p. and 
the sides in centimetres to 2 d.p. In each case, start by drawing a labelled sketch, 
as I did in the previous example. It's particularly important to do this exercise 
because things are not always quite as they seem. 

(1) Triangle ABC in which AA = 78°, AB = 65 0 and AS = 5 cm 

(2) Triangle ABC in which AC = 33 0 , BC = 6 cm and AB = 4 cm 

(3) Triangle ABC in which AC = 40 0 , AB = 9 cm and BC = 5 cm 



4.B.(b) Another area formula for triangles 

The most usual formula for the area of a triangle is 



the area of the triangle = 5 base X height. 



You can see that this must be so from Figure 4.B.3 below which shows the triangle as half 
a rectangle. 



A 




Figure 4.B.3 



Sometimes it is useful to be able to write this area in another way. We know that 
the area = \ ah 

but h = b sin C = c sin B as we saw when we proved the Sine Rule in Section 4.B.(a), above. 
So, by symmetry, 

the area = 5 ab sin C = \ ac sin B = \ be sin A. 

In words, we can say 



The area of a triangle is equal to one half of any two sides multiplied together and 
then multiplied by the sine of the angle between them. 



Here is an example of the use of this new formula. 

Find the area of the equilateral triangle ABC with sides of length 3 cm, shown in Figure 
4.B.4. 
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6 

Figure 4.B.4 



3 



Instead of having to mess around finding the vertical height, we can say that 

i 9^3 

the area = 5 X 3 X 3 X sin 60° = = 3.90 cm 2 to 2 d.p. 

4 

The new formula is particularly useful for finding the area of triangles enclosed by two 
radius lengths in circles such as the one I've shown in Figure 4.B.5. I've marked the angle 
with the Greek letter 0 (called theta), since this is often used for angles. 




Figure 4.B.5 



The area of the triangle is \ r 2 sin 0. 



4.B.(c) The Cosine Rule for any triangle 

Suppose we have a triangle in which we know the lengths of the three sides, and we want 
to find its angles, like the one in Figure 4.B.6. 




Figure 4.B.6 

4.B Widening the field in trigonometry 149 



The Sine Rule will be of no help to us here because it always involves two angles. But 
there is a formula which will help us, which is called the Cosine or Cos Rule. 

To get this, we start with a general-shaped triangle like we did with the Sine Rule, 
and label it in the same sort of way, except that this time we let the length of BH = x. 
(See Figure 4.B.7.) 



A 




Figure 4.B.7 



In triangle ABH, using Pythagoras' Theorem, we have 

c 2 = h 2 + x 2 so x 2 = c 2 -h 2 . 

What is the length of CH using the given letters? Use this to write down how Pythagoras' 
Theorem will go for A AHC. 



CH= a-x. 
So, in A AHC, we have 

b 2 = h 2 + (a- xf = h 2 + a 2 + x 2 - lax. 
But x 2 = c 2 - h 2 , so we have 

b 2 = h 2 + a 2 + c 2 - h 2 - lax = a 2 + c 2 - lax. 
In A ABH, what is cos Bl 



We have 

x 

cos B = — so x = c cos B. 

c 

Therefore, we have 

b 2 = a 2 + c 2 - lac cos B. 

Equally, by symmetry, we have the two other formulas which we could have got by labelling 
the triangle differently. 

We now have the Cosine Rule for any triangle. 
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ine Cosine Kule 




a — b + c — ibc cos a 


(!) 


b 2 = c 2 + a 2 - lac cos B 


(2) 


c 2 = a 2 + b 2 - lab cos C 


(3) 



Notice also that if we put A = 90° in (1) above, we get Pythagoras' Theorem for what is 
now a right-angled triangle. 

That is, we get a 2 = b 2 + c 2 because cos 90° = 0, so everything connects up as it should do. 

Here is an example of using the Cosine Rule to find a side of a triangle. We will use it 
to find a in AABC shown in Figure 4.B.8. 




5 <x C 

Figure 4.B.8 



This triangle is another example of a case in which the Sine Rule will not give us what 
we want. This is because the known facts slot into it in such a way that every possible 
equation has two unknowns. 

We would have 

a 10 8 

= = which is no use. 

sin 72° sin B sin C 

Using the Cosine Rule, we have a 2 = b 2 + c 2 - Ibc cos A. 

Substituting the known values, this gives us a 2 = 64 + 100 - 160 cos 72° so a = 10.7 
to 1 d.p. 

If we want to find the angles of a triangle using the Cosine Rule, it will pay us to 
rearrange the three formulas. 

For example, we have a 2 = b 2 + c 2 - Ibc cos A so Ibc cos A = b 2 + c 2 - a 2 . 
Rearranging this gives us 



b 2 + c 2 - 


a 2 




cos A = 




(1) 


Ibc 




c 2 + a 2 - 


b 2 




cos B = 




(2) 


lea 




a 2 + b 2 - 


c 2 




cos C = 




(3) 


lab 





shifting the letters round again in turn to give the other two formulas. 
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We take the triangle from the beginning of this section to show the use of the Cosine 
Rule to find its angles. It has sides of 5 cm, 7 cm and 9 cm and I show it again in 
Figure 4.B.9. 



c 




Figure 4.B.9 



We will now find the angles^, B and C. I want the angles to go in this way, which is why 
my lettering of the triangle isn't the usual one. 
Using the Cosine Rule to find /-A, we have 

b 2 + c 2_ a 2 49 + 8i _ 25 105 

cos A = = so cos A = 

Ibc 126 126 

and L A = 33.6° to 1 d.p. (Z. A = 33.56° to 2 d.p.) 
Similarly, using the Cosine Rule again to find AB we have 

c 2 + a 2 -b 2 81 + 25 - 49 57 

cos B = = = — so B = 50.7(0)° to 1 d.p. 

lea 90 90 

Working with 2 d.p. to avoid a rounding error in the first decimal place, we can find the third 
angle using the angle sum of the triangle. 

This gives us^C= 180° - 33.56 - 50.70° = 95.7° to 1 d.p. which is an angle greater than 
90°. 

Are we going to have the same problem that we had with the Sine Rule if we are dealing 
with an angle which might be greater than 90°? Will we be unsure about the shape of the 
triangle? 

If we had used the Cosine Rule to find we would have got 

a 2 + b 2 -c 2 25 + 49 - 81 7 

cos C = = = - — . 

lab 70 70 

If you now use your calculator to find L-C (putting in the fraction complete with its minus 
sign), you will find that you again get 95.7° to 1 d.p. so it agrees with what we know it 
should be. 

We find, using the Cosine Rule, that angles between 90° and 1 80° have a negative cos. 
This means that there can't be any ambiguous cases from using the Cosine Rule - we will 
know from the sign of the answer whether the angle we have found is less than 90° (acute), 
or greater than 90° (obtuse). 

We saw earlier that, if the angle A = 90°, then the Cosine Rule for angle A of 
a 2 = b 2 + c 2 - Ibc cos A becomes a 2 = b 2 + c 2 (that is Pythagoras' Theorem). 

If the angle A is acute, we are taking something off b 2 + c 2 to get a 2 . 

If the angle A is obtuse, because cos A is then negative, we are adding something on to 
b 2 + c 2 to get a 2 . 
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c c c 

The lengths of b and c are the fame In ail three triages. 

Figure 4.B.10 



You can see from the three cases which I show in Figure 4.B.10 that this must happen in 
order that the length of a will work out correctly in each case. 

If you think that the angle you are finding may be obtuse, it is safer to use the Cosine Rule 
if possible, rather than the Sine Rule. 

I shall explain exactly what we mean by the cos of an angle greater than 90° in Section 
5.A.(c). 

exercise 4.B.2 Now try the following questions. 

(1) Find the sides and angles marked with a question-mark in the three triangles 
shown in Figure 4.B.11. 




Figure 4.B.11 



(2) Figure 4.B.12 shows a triangle formed by joining together the two halves of an 
equilateral triangle by their shortest sides. 




I t 

Figure 4.B.12 



(a) How large are the angles Q and /?? 

(b) How large is AQPR? 

(c) Use the Cosine Rule in AQPR to find the cos of AQPR. 

(d) Use the Sine Rule in AQPR to find the sin of AQPR. 



4.B Widening the field in trigonometry 
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4.C Circles 

4.C.(a) The parts of a circle 

Once we start considering angles larger than 90°, we become involved with the circles which 
are used to show their turn (Figure 4.C.1). 



^0° 




Figure 4.C.1 

The convention is that angles are shown turning anticlockwise from the positive x-axis, 
so that angles from 0° to 90° lie in the quarter-circle or quadrant where all measurements 
are positive. (Bearings are not measured like this; they turn clockwise from a zero position 
at due north.) 

Because circles are intimately connected with the trigonometry of angles which are 
greater than 90°, I am including a section specially devoted to them next. 

I start with a reminder of the names of the parts of a circle which we shall need to use. 
These are shown in Figure 4.C.2 and described underneath. 




Figure 4.C.2 



• The whole curve of the circle is called the circumference. 

• Any line from the centre to the circumference is called a radius (plural: radii). Clearly, 
from the symmetry of the circle, these are all the same length. 

• A line drawn right across a circle through its centre is called a diameter. 

• A line like AB drawn across a circle is called a chord, so a diameter is a special case 
of a chord. 

• The curved piece of the circle from A to B is called an arc. The short way round from 
A to B is called the minor arc, and the long way round is called the major arc. 
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• The part of the circle enclosed between the minor arc AB and the chord AB is called 
a minor segment. The rest of the circle is a major segment. 

• The shaded piece shown in circle (c) is called a minor sector. The rest of the circle is 
called a major sector. 

To avoid mixing up segments and sectors, you can remember that 'a sector is like a piece 
of cake because it's got a "c" in it'. 

If the radius of the circle is r, then the length of the circumference is 2nr, and the area 
of the circle is itr 1 . jt is a number which cannot be written exactly as a fraction (though 22/7 
is sometimes used as an approximation to it.) To 4 d.p. it is 3.1412. As a decimal, it is non- 
repeating, and has been calculated to a huge number of decimal places using computers. 



If C stands for the circumference and A stands for the area 
C = 2jir and A = Jir 2 . 



4.C.(b) Special properties of chords and tangents of circles 

The chords and tangents of circles have special properties because any diameter of a circle 
is a line of symmetry. 

(The circle can be folded along any diameter so that the two halves exactly match.) 



The most important properties of chords and tangents 

• Any line perpendicular to a chord from the centre of the circle divides that 
chord equally in two (or bisects it). 

• If a line from the centre of a circle divides a chord equally in two then it must 
be perpendicular to that chord. 

• Any line which is perpendicular to a chord and bisects it must pass through the 
centre of the circle. 

• If a chord is pushed to the edge of a circle and extended to make a tangent (a 
line which touches the circle and gives its slope at that point), the tangent is 
perpendicular to the radius to the point of contact. 

• The two tangents to a circle from any outside point must be equal in length. 



I show examples of all these properties in Figure 4.C.3. 




Figure 4.C.3 
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The matching pairs of little marks show lines which are equal in length. 

Draw in the diameters which show the lines of symmetry in colour if it helps you. 

4.C.(c) Special properties of angles in circles 

We come next to a result which does not come so obviously from the symmetry of the circle. 

In Figure 4.C.4, 1 have shown three angles all standing on the same arc of the circle. This arc 
is drawn with a thicker line. If you measure these three angles, you will find that they are all 
equal. Any similar drawings will give other sets of equal angles. Why should this be so? 




Figure 4.C.4 



To find the answer to this, we compare the size of the angle at the centre of the circle with 
any angle at the circumference which stands on the same arc. 

We can do this in the way I have shown in the sequence of drawings in Figure 4.C.5. 




Figure 4.C.5 
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From this, we see that the angle at the centre of the circle is twice the size of the angle 
at the circumference. 

This will be true wherever this angle touches the circumference above AB, so long as it 
is standing on the same arc, so all the angles standing on this arc must be equal; an 
unexpected and beautiful result. 

If the angle is below AB, as I show in Figure 4.C.6, the angle at the circumference 
is still half the angle at the centre, but we are looking at the situation upside down, so 
the angle at the centre is now greater than 180°. (An angle like this is called a reflex 
angle.) The two angles are now standing on the major arc of the circle which I have 
shown using a thicker line. 




Figure 4.C.6 



From these two results we can now deduce a useful special case, which is that the 
angle in a semi-circle is a right angle. 

We can see that this must be so either way round from the two diagrams shown in 
Figure 4.C.7. 




Figure 4.C.7 



A summary of special properties of angles in circles 

• The angle at the centre of a circle is twice any angle standing on the same arc. 

• Angles at the circumference and standing on the same arc are equal. 

• The angle in a semi-circle is a right angle. 
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(a) Is it possible to draw a circle round any triangle as in Figure 4. C. 8(a)? 

(b) Is it possible to draw a circle round any four-sided shape (quadrilateral) 
as shown in Figure 4. C. 8(b)? 




Figure 4.C.8 



In each case, if it isn't always possible, what special conditions must you 
have in order to be able to do it? 



4.C.(d) Finding and working with the equations which give circles 

How can we find the equation of the curve which gives a particular circle in terms of 
x and yl 

We will start by considering the simplest case which is a circle of radius r sym- 
metrically placed so that its centre is at the origin. I have drawn a circle like this in Figure 
4.C.9(a). 




Figure 4.C.9 



Any point P on it, with coordinates (x, y), must be a distance r from the origin, so 
x 2 + y 2 = r 2 by Pythagoras' Theorem. 
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The equation of any circle with radius r and whose centre is the origin can be 
written in the form x 2 + y 2 = r 2 . 



For example, if the radius r is 4 units, we get the circle whose equation is x 2 + y 2 = 3 2 or 

x 2 +y 2 = 9. 

If the centre of the circle is not at the origin, we can still use the property that the distance 
of any point on the circumference from the centre is equal to the constant length of the 
radius. 

In Figure 4.C.9(b) the length of PC remains constant, and equal to r. 

If P has coordinates (x,y), using Pythagoras' Theorem here gives us (x - a) 2 + (y - b) 2 = r 2 . 



The equation of the circle with centre (a,b) and radius r is given by 

{x-af + iy- bf = r 2 . 



For example, the circle with a radius of 4 units, and with its centre at the point (6,5), has 
the equation 

(x - 6) 2 + 0 - 5) 2 = 4 2 

or x 2 - I2x + 36+y 2 - lOy + 25 = 16 

giving x 2 - I2x + y 2 - 10y + 45 = 0. 

(These numbers will fit Figure 4. C. 9(b) quite nicely. If you are at all unsure about the 
algebra version of the equation of this circle, feed in the numbers to make yourself an actual 
example of the algebra working.) 

Now we do the same thing of multiplying out with the algebra version of the equation 
given in the box above. 

We have (x - a) 2 + (y - b) 2 = r 2 . 

Multiplying out the brackets gives x 2 - lax + a 2 + y 2 - 2by + b 2 = r 2 
Tidied up, this gives us an alternative form for the equation of this circle. 



The equation of the circle with centre (a,b) and radius r can also be written as 

x 2 - lax + y 2 - 2by + c = 0 where c = a 2 + b 2 - r 2 . 



For an equation like this to give a circle it must fit the following conditions. 

(1) There must be equal coefficients of x 2 andj 2 . The coefficient is the number which 
tells us how many we've got. The coefficient of 3x 2 is 3. The coefficient of_y 2 is 
1. If there are no terms in x, say, then the coefficient of x is zero. 

(2) There must only be, at the most, terms in x 2 , y 2 , x, y and a number. 
(We mustn't have any terms with xy, for instance.) 

(3) The value of r 2 must be positive so that we have a physically possible length for 
the radius. 
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^^^^ It's easy to remember that the circle with equation x 2 - lax + y 2 - 2by + c = 0 
^H-^V has its centre at the point (a, b). But its radius is not c. 

From above, we have r 2 = a 2 + b 2 - c so r = \a 2 + b 2 - c. 

This is a very clumsy formula to remember. I think that much the best way of finding the 
centre and radius of a circle is to complete the two squares. (Completing the square is 
explained in Section 2.D.(b).) 

Here is an example of this, to show you how it works. 

Suppose we have the circle whose equation is x 2 - Ax + y 2 + 6y - 3 = 0. 

Completing the two squares gives us (x - 2) 2 - 4 + (y + 3) 2 - 9 - 3 = 0 so 
(x-2) 2 + (y + 3) 2 = 16. 

Therefore the centre of the circle is at (2, -3) and its radius is 4 units. 

^^^^ Notice that the signs flip to give the coordinates of the centre, just as they 
^B-^V do to give the solutions to quadratic equations. 



exercise 4.C.1 Find the centre and radius of each of the following circles. 

(l) (x - i) 2 + (y + 2) 2 = 16. (2) x 2 + y 2 - 2x - 4y = o. 
(3) x 2 + y 2 - 8x + 7 = o. (4) x 2 + y 2 - 6x + 2y - 6 = o. 

(5) x 2 + y 2 - x + y = 0. (6) x 2 + y 2 + 3X + iy + 1 = 0. 

(7) Find the equation of the circle which is concentric with the circle 
x 2 + y 2 + 2x - 4y = 0 and which has a radius of 5 units. 
('Concentric' means 'having the same centre as'.) 

(8) Find the equation of the circle which passes through the origin and the points 
(3,0) and (0,4), writing it in the form x 2 - 2ox + y 2 - 2by + c = 0. 

Find also its centre and radius. 

4.C.(e) Circles and straight lines - the different possibilities 

What are the three possible relationships between a straight line and a circle? Try sketching 
them for yourself. 



You should have a line which passes through the circle so that it cuts it twice, a line which 
just touches the circle and so is a tangent, and a line which misses the circle altogether. 

How will these three different possibilities show up if we work from the equations of the 
particular line and circle? 

We will go through the following example together, to see what happens. 

example (1) Find whether, and if so where, the lines 

(a) y = 2x - 4 (b) 3y = x + 11 and (c) y = 3x + 6 

cut the circle whose equation is x 2 - 4x + y 2 - 2y - 5 = 0. 

Draw a sketch showing the three lines and the circle. 
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(a) If the line y = 2x - 4 cuts the circle, the values of x and y at the points where it cuts 
must fit both the equations of the circle and of the line. (In other words, we have 
two simultaneous equations at these points, but they involve a line and a circle 
instead of two straight lines like the ones in Section 2.C.) 

This means that we can put y = 2x - 4 into the equation of the circle to find the 
possible values of x. 
This gives us 

x 2 -4x + (2x - 4) 2 - 2(2x - 4) - 5 = 0 

x 2 - 4 X + 4 X 2 - I6x + 16 - Ax + 8 - 5 = 0 

5x 2 - 24x + 19 = 0 

(5x - 19)(x - 1) = 0 

x = 1 or x = — . 

(You could use the formula for quadratic equations from Section 2.D.(d) to find 
these two roots if you prefer.) 

Substituting these values of x back in the line y = 2x - 4 gives us the 

18 

corresponding two values for y of -2 and — . 

So the line y = 2x - 4 cuts the circle at the two points with coordinates (1, -2) 
and (y, y). Sometimes, the word 'intersects' is used instead of the word 'cuts'. 

(b) To find if the line 3y = x + 11 cuts the circle, we can rewrite its equation as 
x = 3y - 1 1 and substitute this for x in the equation of the circle. 

This gives us 

(3y - ll) 2 - 4(3j - 11) + y 2 - 2y - 5 = 0 
9y 2 - 66y + 121 - I2y + 44 + y 2 - 2y - 5 = 0 

10y 2 - 80y + 160 = 0 
y 2 - 8y + 16 = 0 
(y - 4) 2 = 0. 

The two possible cutting points have come together here to give the single point 
for which y = 4 and x = 12 — 11 = 1. 

This means that the line 3y = x + 11 just touches the circle - it is a tangent 
to it. 

The point of contact has the coordinates (1,4). 

(c) This time, we put y = 3x + 6 in the equation of the circle. 

This gives us 

x 2 - 4x + (3x + 6) 2 - 2(3x + 6) - 5 = 0 

x 2 - 4x + 9x 2 + 36x + 36 - 6x - 12 - 5 = 0 

10x 2 + 26x +19 = 0. 

Using the quadratic formula on this equation, with a = 10, b = 26 and c = 19 gives 
b 2 - 4ac = -84, so we can't find any value for x which will satisfy this equation. 
This must mean that the line misses the circle completely. 
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The three different quadratic equations of (a), (b) and (c) have revealed exactly what is 
happening geometrically. 

For the sketch, we need the centre and the radius of the circle. 
We have 

x 2 - 4x + y 2 - 2y - 5 = 0 

(x - 2) 2 - 4 + 0 - l) 2 - 1 - 5 = 0 

so (x-2) 2 + (y- 1) 2 = 10. 

The centre of the circle is at the point (2,1) and its radius is /T(l 

I have drawn a sketch of the three lines and the circle in Figure 4.C.10. 




Figure 4.C.10 



I've summarised the results which we have just found in the box below for you. 



Straight lines and circles 

Substituting the equation of the line into the equation of the circle will give you a 
quadratic equation in x or y. 

There are then three possibilities. 

• The equation has two roots. This means that the line cuts the circle in two points. 

• The equation has one repeated root. This means that the line is a tangent to the 
circle - it just touches it. 

• 'b 2 - 4ac' is negative, and the equation has no real roots. 
This means that the line misses the circle altogether. 
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EXERCISE 4.C.2 



Find whether, and if so where, the lines (a) 3y = x - 5 (b) iy 
(c) y = 2x + 3 cut the circle x 2 - 6x + y 2 - iy + 5 = o. 
Draw a sketch showing the three lines and the circle. 



= x + 4 and 



4.C.(f) Finding the equations of tangents to circles 

The circle is the first curve for which we can find the steepness or gradient at any point on 
it. We saw in Section 4.C.(b) that any tangent to a circle must be perpendicular to the radius 
going to the point of contact. The gradient of the tangent will then tell us the slope or 
gradient of the circle at this point of contact. 

We will look at the following example together to see how these ideas work out in 
practice. 

example (1) Find the equations of the four tangents to the circle 

x 2 -6x + y 2 -4y- 12 = 0 

with points of contact (a) (7,5), (b) (-1, -1), (c) (8,2) and (d) (3,7). 
Draw a sketch showing the circle and these four tangents. 
We start by finding the centre and radius of the circle. 
We have 

x 2 - 6x + y 2 - 4y - 12 = 0 = (x - 3) 2 - 9 + (y - 2) 2 - 4 - 12. 

So the equation of the circle is also given by (x - 3) 2 + (y - 2) 2 = 25. 

Its centre is at the point (3,2) and its radius is 5 units. 

I have drawn a sketch of this circle in Figure 4.C.11 showing the 
first tangent that we shall find. I think that it will help you in the 
working which follows if you sketch in how you think the other three 
tangents will go. 




(a) The first tangent touches the circle at the point (7,5). 

The radius to the point of contact joins (3,2) to (7,5), so its 
gradient is 

y 2 -y\ 5-2 3 



X2 — X\ 7 — 3 



using Section 2.B.(d). 
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The tangent is perpendicular to this radius, so its gradient is -3, 
using m l m 2 = -1 from Section 2.B.(h). 

4 

It passes through the point (7,5) so its equation is y - 5 = - j (x - 7). 
(This uses y - y x = m(x -Xj) from Section 2.B.(f).) 
Tidied up, this gives 3y - 15 = - 4x + 28 or 3y + Ax = 43. 
I have shown this tangent on my sketch on the previous page. 

Try finding the other three tangents yourself. 
If curious things happen, look at the sketch and see if you can 
see why. 



This is what you should have. 

-1-2 3 

(b) The gradient of the radius which joins (3,2) to (-1, -1) is — j — - = — . 

4 

Therefore, the gradient of tangent (b) is -3. 

4 

The equation of tangent (b) is y + 1 = -5 (x + 1) or 3y + Ax + 7 = 0. 
You can sketch this tangent yourself, if you haven't already done so. 
It is parallel to the one which we found in (a). 

2-2 

(c) The gradient of the radius which joins (3,2) to (8,2) is = 0. 

8 — 3 

This gives us a real problem for finding the equation of the 
tangent by algebra but, when we look at the sketch, everything 
becomes clear. 

The gradient of this radius is zero because it is horizontal. 

Therefore the tangent at the point (8,2) is vertical and its 
equation is x = 8. 

(The x coordinate of every point on it is 8 while the y coordinate 
can be any value you choose. Excellent thinking if you got this 
equation correctly!) 

If you got stuck on this one, have another go now at answering (d). 

(d) The gradient of the radius which joins (3,2) to (3,7) is given by 

7-2 4 
3-3 0' 

This gives us even more algebraic trouble since we know we 
can't divide by zero. (Students in desperation sometimes say that 
this fraction is equal to zero but this is not true!) 

Again, looking at the sketch we see that everything falls into 
place. 

This radius is vertical and the tangent at the point (3,7) is 
horizontal. Its gradient is zero and its equation is y = 7. 

Add tangents (c) and (d) to the sketch if you haven't already 
done so. 

Because the circle is a curve for which we can find out what is 
happening with the algebra which we can do now, the example 
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above will be very useful to you when you start working with the 
slopes of general curves using implicit differentiation in Section 
8.F.(a). It will help you to see why things happen in the way that 
they do. 

exercise 4.C.3 Draw a sketch of the circle x 2 + i6x + y 2 - by - 101 = o. 

Find the equations of the four tangents to this circle with the points of contact 
(a) (4, -3), (b)(-3, 14). (cM-21,2) and (d) (-8, -11). 
Show these four tangents on your sketch. 

4.D Using radians 

4.D.(a) Measuring angles in radians 

So far, all the angles to which we have given a size have been measured in degrees. This 
form of measurement has an arbitrary element about it in that somebody originally 
decided that 90 would be a nice number of units to have in a right angle. It could equally 
well have been 100 or 80, say. Had the scale been chosen by Napoleon, it probably would 
have been 100, to fit with his other metric measurements. (Indeed, the mysterious 
gradians on your calculator are divided so that there are 100 parts to each right 
angle.) 

The special property of the radian is that it does not depend upon any arbitrary choice of 
number. It does depend on that beautiful and symmetrical shape, the circle. 
I show how in Figure 4.D.I. 




Figure 4.D.1 



If we draw an angle as shown in Figure 4.D.l(a), so that the length of the arc is equal to 
the radius, then this angle is defined to be 1 radian. 

If the arc is 2 radius lengths long, the angle is 2 radians (Figure 4.D.l(b)). 



From Figure 4.D.l(c), an angle of 0 radians gives an arc length of rO. 



(6 is the Greek letter theta and is a hot favourite for describing an unknown angle, just 
as x is for describing general unknown quantities.) 
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Since a full turn gives an arc length of the whole circumference of the circle, which is 
an arc length of 2jir, we see from Figure 4.D.l(d) that a full turn is 2n radians. 
This means that 2jt radians is the same angle as 360°. 

Remembering, too, that it is a bit bigger than 3, we have the following box of results. 



Useful rules connecting degrees and radians 

• it radians is the same angle as 180°. 

(You can think of jt as a symbol for a straight line angle.) 

• To convert degrees to radians, multiply by jr/180. 

• To convert radians to degrees, multiply by 180/jt. 

• It is useful to remember that one radian is just slightly less than 60°. 



(In practice, you very rarely have to use the conversion from degrees to radians or vice 
versa, because you will set your calculator in either degree or radian mode depending upon 
which units you want to work in.) 

Because radians come from the structure of the circle, they will slot directly into any 
working involving angles when we use calculus. If we work with degrees, however, we shall 
keep having to do a sort of gear change - and it's much nicer not having to worry about that! 
For this reason you need to be happy working with radians, so it is a good idea now to 
become familiar with the corresponding radian measurements for the standard divisions 
of 360°. 



exercise 4.D.1 Use the two circles of Figure 4.D.2 to help you to fill in the missing angles in the 

table. 




270» 27C? 
Figure 4.D.2 



Degrees 


0 






60 


90 




135 


150 


180 




240 


270 




360 


Radians 


0 


71 

6 


71 

4 




71 
2 


231 

3 








6 






ZE 
4 
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4.D.(b) Finding the perimeter and the area of a sector of a circle 

I have shown the minor sector AOB shaded in the circle with radius r in Figure 4.D.3. 




"The anjle © is in radians. 

Figure 4.D.3 



We know from the last section that the arc length AB is equal to r9. 
Therefore, the length of the perimeter of the sector AOB (that is, the distance round its 
boundary) is given by 2r + r6. 

^B^P Don't forget to include the two radius lengths here. 

™ 



The perimeter of the sector is 2r + r6. 



We can find the area of the sector AOB by thinking of it as a fraction of the area of the 
whole circle (which is Jir 2 ). 



The area of the sector AOB is given by — X jrr 2 = \ r 2 d. 

2jz 



^B^H Both these formulas are only true if 0 is in radians. 

™ II 

Try writing down for yourself what the area of the major sector AOB is (that is, the area 
of the rest of the circle). 
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Subtracting the area of the minor sector AOB from the area of the whole circle gives the 
result that the area of the major sector JOS = nr 2 - jr 2 9. 

Alternatively, you could say that the angle of the major sector is 2jt - 9. 
Therefore its area is given by 

\r\2n -9) = nr 2 - \r 2 9. 

4.D.(c) Finding the area of a segment of a circle 

We can find the area of the segment drawn in Figure 4.D.4 by noticing that it comes from 
subtracting AAOB from sector AOB. (I'm using A to stand for 'triangle'.) 




Figure 4.D.4 



Again, the angle 9 is in radians. 

We know from Section 4.B.(b) that the area of AAOB is equal to jf 2 sin 9, so the area 
of the segment shown (that is, the minor segment), is given by the rule below. 



The area of the segment AOB = \r 2 9 - \r 2 sin 9 = \r 2 (9 - sin 9) 



(Make sure that your calculator is in radian mode when you find this!) 
Now try writing down for yourself the area of the major segment AB (that is, the 
unshaded part of the circle in Figure 4.D.4). 



It is given by Ttr 2 - \r 2 (9 - sin 9) = \r 2 (In - 9 + sin 9). 

4.D.(d) What do we do if the angle is given in degrees? 

I will call the angle D° to avoid confusing it with the angle 9 in radians. 
There are two things you can do in this situation. 

Method (i) Immediately convert the angle D° into radians by multiplying it by 

jt/180. (See Section 4.D.(a) if necessary.) Then you can use all the rules 
given above for angles in radians. This is the method I would 
recommend. 
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Method (2) Alternatively, you can change the rules that we have already found so 

that they will be right for working with angles in degrees by replacing 6 
by Iter/180. 

This will then give you, for an angle D measured in degrees, 

Dji jirD angle 

(1) The arc length is r X = = X circumference. 

180 180 360 

(2) The area of the sector is 

1 Djt Jtr 2 D angle 

— r 2 X = = X the area of the circle. 

2 180 360 360 

These rules are more clumsy than the rules for radians because of 
the arbitrary nature of the choice of 360 for the number of degrees 
in a full turn. 

Because radians use the structure of the circle itself, they give 
much nicer results. 



exercise 4.P.2 Now try these questions, giving your answers correct to 2 d.p. (if they are not 

exact) in the units used on the drawings. 

(1) Using the sketch shown in Figure 4.D.5(a), find 

(a) the minor arc length AB, 

(b) the area of A AOB, 

(c) the area of the minor segment AB. 

(2) Find the shaded area (that is, the major sector) shown in Figure 4.0.5(b). 




Figure 4.D.5 



The circle shown in Figure 4.D.5(c) above has a fixed radius of r units. What 
do you think the size of the angle 6 should be in order to make triangle AOB 
have maximum area? 



4.D.(e) Very small angles in radians - why we like them 

Radians have a second very special quality, as well as being independent of anyone's 
particular choice of number. 

Suppose we start with an angle of 0 radians as shown in Figure 4.D.6. 
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Figure 4.D.6 



We know from Section 4.D.(a) that the arc length is rO, and we also know that 

y x y 

sin 9 = —, cos 6 = — and tan 6 = — . 
r r x 

What happens to these trig ratios as 0 becomes very small? 

Try finding this out yourself experimentally with your calculator. Use radian mode, 
and put in very small values for the angle, say 0.001 as one possible value. See what 
values the answers are close to. Can you see why this might be if you look at the drawing 
of Figure 4.D.7? 




< X > 

Figure 4.D.7 



Look also to see if there seems to be any connection between the size of the angle that 
you put in and the values for sin, cos and tan that you get out. 

^ I 

^■^^ Remember that your calculator must be in radian mode for this experiment. 

^H-^V A mistake here will seriously affect your results. (For example, 1° is quite a 
small angle, but 1 radian is about 60°, so an input of 1 will give you vastly 
different results depending on which mode your calculator is in.) 



You should now have a good experimental idea of what is happening. 

We will now look together at why this should be so. 

Figure 4.D.7. shows a very small angle 0 set inside its circle. 
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As 9 becomes increasingly smaller, x becomes closer and closer to r so cos 0 —> 1 . 

(The — > symbol I have used above is a mathematical shorthand for saying 'becomes 
increasingly closer in value to'. It saves a lot of writing!) 

Also, y becomes very small indeed, so sin # — > 0, and tan 9 — > 0 also. 

But you should also have found a more startling result. Not only are sin 9 and tan 9 
becoming very small, they are also becoming very close to 9 itself, as 9 becomes small. 

We can see from the diagram that this must be so. 

As y becomes smaller it gets closer and closer in length to the arc r9. So 



r9 

sin 9 — > — , that is sin 9 -> 9 as 9 — > 0. 

r 



The smaller the angle becomes, the closer these two are. We also see that sin 9 will 
always be slightly less than 9 because y stays less than r9. Notice that the arc r9 will become 
closer and closer to a straight line as 9 becomes smaller. 

Now, what happens to tan 91 

Since tan 9 = ylx, it is clearly going to get smaller and smaller just as sin 9 does. It looks 
from the calculator as if it is close to 9 too, but a little bit larger. 
Will it stay like this? 

We can see that it will from Figure 4.D.8. 




Figure 4.D.8 



This uses the fourth property from Section 4.C.(b) to give the right angle between the 
radius and the tangent. Using this right-angled triangle, tan 9 = dlr, but d is getting closer 
and closer to r9 while remaining just slightly larger. 

So 



r9 

tan 9 -> — , that is tan 9 -> 9 also, as 9 — > 0. 
r 



But it stays slightly larger than 9 while sin 9 stays slightly smaller. 
The fact that when we measure in radians sin 9 and tan 9 are approximately the same as 
9 when 9 is very small is of crucial importance when we come to calculus. 
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4.E Tidying up - some thinking points returned to 

4.E.(a) The sum of interior and exterior angles of polygons 

At the end of Section 4.A.(e) on congruent triangles, I asked you if you could find the sum 
of the interior angles of a six-sided figure. (This is called a hexagon.) 




Figure 4.E.1 



(a) One way of answering this question is to split the shape into triangles by joining 
up to one corner as I have shown in Figure 4.E.l(a). 

This gives us four triangles, that is, two fewer triangles than there are sides. 
Together they account for all the interior angles. 

We see, therefore, that the sum of the interior angles is 4 X 180° = 720°. 

(b) You could also have got this answer by joining up each corner (or vertex) to some 
point inside the hexagon, as I have shown in Figure 4.E.l(b). This would then give 
you six triangles, so 6 lots of 180°. You then take off the 360° for the full turn in 
the middle, so finishing up with the same answer as (a). 

You can then use either of these methods to answer my third question. 
Using (a), we can say that, if the polygon has n sides, splitting it up in the same 
way will give n - 2 triangles. 

Therefore the sum of the interior angles would be (n - 2) X 180°. 
This result is usually written in the following form. 



The sum of the interior angles of an «-sided polygon is equal to (2n - 4) 
right angles. 



The sum of the exterior angles will be the same whatever the shape of the hexagon is, 
so long as we are turning inwards all the while as we go round. 

We find this sum by noticing in Figure 4.E.2(a) that we have six straight lines formed by 
the exterior angles and the interior angles together. 

Therefore, the exterior angles together make 6 X 180° - 720° = 360° or a full turn. 

We can see that this must be so because if we start at A and travel round the sides of the 
shape, we will have made a full turn when we come back to A. This full turn is built up from 
all the small turns made by the exterior angles, as I have shown in Figure 4.E.2(b). Exactly 
the same thing will happen however many sides the shape has, provided we are always 
turning inwards as we go round, that is, none of the interior angles is greater than 180°. The 
exterior angles will always add to four right angles. 
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Figure 4.E.2 



Indeed, this result is still true if our particular choice of shape means that we do sometimes 
turn outwards, but in this case we must count these outwards turns as negative. 

4.E.(b) Can we draw circles round all triangles and quadrilaterals? 

I asked you this question at the end of Section 4.C.(c) on the special properties of circles. 
The answer is that it is always possible to draw a circle round a triangle. 
You can see this from the drawings of Figure 4. E. 3(a) and (b). 




Figure 4.E.3 



From (3) in Section 4.C.(b), the centre of the circle would have to lie on the line PQ. (The 
little marks are to show that PQ divides BC equally in two as well as being perpendicular to it.) 
For the same reason, it would have to lie on RS. 

But where PQ and RS cross, we have CO = BO and BO = AO. So CO = AO too, and O 
is the centre of the circle which triangle ABC sits inside. 

We can also see from this that it isn't always possible to draw a circle round a 
quadrilateral like ABCD. 

If we have a quadrilateral ABCD sitting inside a circle, as in Figure 4.E.4, then this must 
be the particular circle which can be drawn round triangle ABC. 

But a small adjustment to D, either inwards or outwards, will mean that this point is no 
longer on the circle which works for A, B and C. 

So what particular property must AB CD have for it to be possible to draw a circle through 
its four corners? 
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Figure 4.E.4 



We can see the answer to this from Figure 4.E.5(a). 

Using (1) from Section 4.C.(c), we know that AAOC = 2AABC. 

Looked at the other way up, the other part of AAOC = 2AADC. 

But the two parts together of ^AOC make 360°, so ^ABC + /LADC = 180°. 

Also, since LA + /LB + L.C + L.D = 360°, LA + = 180° too. 




Figure 4.E.5 



It is only possible to draw a circle through the four corners of a quadrilateral if its 
opposite angles add up to 180°. Such a quadrilateral is called cyclic. 

This is the same as saying that each exterior angle must equal its interior opposite angle. 
We can see that this must be so from Figure 4. E. 5(b) since the two angles at^ together make 
a straight line. 
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5 Extending trigonometry to angles 
of any size 

This chapter makes it possible for us to use trig ratios with angles of any size, and 
looks at the graphs of these trig functions. These are very important in many 
physical applications, so we look at what happens if we shift them and combine 
them. We also look at methods of handling trig functions and equations. 
The chapter is divided into the following sections. 

5.A Giving meaning to trig functions of any size of angle 

(a) Extending sin and cos, (b) The graph of y = tan x from o° to 90 0 , 

(c) Defining the sin, cos and tan of angles of any size, 

(d) How does X move as P moves round its circle? 

(e) The graph of tan 0 for any value of 0, (f) Can we find the angle from its sine? 

(g) sin" 1 x and cos" 1 x: what are they? 

(h) What do the graphs of sin _1 x and cos _1 x look like? (i) Defining the function tan" 1 x 

5.B The trig reciprocal functions 

(a) What are trig reciprocal functions? 

(b) The trig reciprocal identities: tan 2 8 + 1 = sec 2 6 and cot 2 6 + 1 = cosec 2 6, 

(c) Some examples of proving other trig identities, 

(d) What do the graphs of the trig reciprocal functions look like? 

(e) Drawing other reciprocal graphs 

5.C Building more trig functions from the simplest ones 

(a) Stretching, shifting and shrinking trig functions, 

(b) Relating trig functions to how P moves round its circle and SHM, 

(c) New shapes from putting together trig functions, 

(d) Putting together trig functions with different periods 

5.D Finding rules for combining trig functions 

(a) How else can we write sin (A + B)l 

(b) A summary of results for similar combinations, 

(c) Finding tan 04 + 6) and tan 04-6), (d) The rules for sin 2A, cos 2A and tan iA, 

(e) How could we find a formula for sin 3/4? 

(f) Using sin (/\ + 6) to find another way of writing 4 sin f + 3 cos f, 

(g) More examples of the R sin (f ± a) and R cos (f ± a) forms, 

(h) Going back the other way - the Factor Formulas 

5.E Solving trig equations 

(a) Laying some useful foundations, (b) Finding solutions for equations in cos x, 
(c) Finding solutions for equations in tan x, (d) Finding solutions for equations in sin x, 
(e) Solving equations using R sin (x + a) etc. 



5.A Giving meaning to trig functions of any size of angle 

5.A.(a) Extending sin and cos 

In the last chapter we discovered that we were able to find the sin and cos of some angles 
between 90° and 1 80° by using the Sine and Cosine Rules for any triangle. (In fact, it would be 
possible, by choosing suitable triangles, to find the sin and cos of any angle in this range.) 



5.A Trig functions of any size of angle 



175 



It seemed, from the results which we got there, that we would need to put sin ( 1 80° - x) 
= sin x and cos (180° - x) = - cos x in order to make the Sine and Cosine Rules work for 
all triangles. If we use this to draw graphs of y = sin x and y = cos x for values of x from 
0° to 180° we will get curves like those in Figure 5.A.l.( a ) an d (b). 



= Sift x 





The shape of these two curves suggests that what we have here is part of a much longer 
pattern, and that indeed they are parts of the same graph which has just been shifted by 90° 
to the left to give the second case. 

This view will seem very reasonable if you have seen, for example, sound waves displayed 
on an oscilloscope, or the graph of an alternating electric current in a wire, or the waves which 
you get along a rope if you fix one end and move the other end up and down. 

From these physical examples, we will get the pair of graphs shown in Figure 5.A.2.(a) 
and (b). I have used units of radians here for the angles. I explain how radians work in 
Section 4.D and if you are at all unsure about them you should go back there now, before 
going on. This is because they are very important throughout this chapter and for future 
work, particularly if it involves calculus. 




Figure 5.A.2 
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Clearly, there is no particular reason to stop anywhere, so we imagine the two graphs as 
extending an infinite distance in both the + and - directions. 

How many special distinctive properties can you see in these two graphs? 
Make a note of as many as you can. 



Here are some of the important particular properties of these two graphs which I hope 
that you will have noticed. 



(1) The cos graph is symmetrical about the v-axis, or the line x = 0. 

For example, cos f = cos(-f ). In fact, cos x = cos(-x), whatever x is. 
A graph like this is called even, as we saw in Section 3.B.(j). 

(2) The sin graph exactly fits onto itself if it is rotated through half a complete turn 
about the origin. If you turn the page upside down, this graph is unchanged. 

You could also describe this by saying that the graph of sin x reverses sign if it 
is reflected through the j-axis. 

sin f = - sin(-j), and sin x = - sin(-x) whatever x is. 

A graph like this is called odd. (Again, there were similar ones in Section 
3.B.G).) 

(3) They are the same graph, except that the sin graph must be shifted Jt/2 to the left 
to give the cos graph. 

For example, sin § = cos 0, sin n = cos § and, in general, sin (x + § ) = cos x. 
(There are other examples of shifts in Section 3.B.(d).) 

(4) Both of the graphs infinitely repeat themselves, with the length of the unit of 
repeat being 2jt in each case. This is called the period of the graph. 

(5) In both cases, the graphs are enclosed in a pair of horizontal lines which are one 
unit either side of the x-axis so the maximum displacement of the graph from this 
axis is one unit. 



exercise 5-A.i We have already found (in Section 4.A.(g)), values for the sin and cos of angles of 

o°, 30°, 45°, 6o° and 90 0 . 

I have shown these values again set out in the table below, using both radians 
and degrees. 



Angle (x) 
degrees 


-180 


-120 


-90 


-30 


0 


30 


45 


60 


90 


120 


180 


210 


270 


315 


360 


radians 


-71 


~T 


— 2 


it 

~6 


0 


It 
6 


71 


It 

3 


it 
2 


nt 

T 




6 


3jt 
2 


lit 

T 
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sin x 










0 


1 
2 


1 

\ ; 2 


V'3 
2 


1 














cos X 










1 


v'3 
2 


1 

\ ; 2 


1 
2 


0 















Use these values, and the symmetrical properties of the graphs shown in 
Figure 5.A.3 (a) and (b), to write down the values of the sin and cos of the other 
angles listed in the table. Check your values using your calculator. 



5.A Trig functions of any size of angle 



177 




Figure 5.A.3 



5.A.(b) The graph of y = tan x from o° to 90 0 

We have not yet thought about what the graph of y = tan x will look like. We know from 
Section 4.A.(g) that 

1 

tan 45° = 1, tan 30° = — = 0.58 to 2 d.p. and 

V3 

tan 60° = y[3 = 1.73 to 2 d.p. 
We also know, from Section 4.A.(h), that 

sin x 0 1 

tan x = so tan 0° = — = 0 and tan 90° = — = trouble, 

cos x 1 0 

since we can't divide by zero. 

Using your calculator, you can see that, the closer the angle gets to 90°, the larger its tan 
becomes. (Try this for yourself.) You can also see that this will happen from the three 
triangles in Figure 5.A.4(a) by finding the tans of the three marked angles. The height of the 
triangles remains the same but the horizontal measurement becomes smaller, so the fraction 
which gives the tan is becoming larger. 

Using all our known information, we get a sketch for y = tan x from 0° to 90° which looks 
like Figure 5.A.4(b). 
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(degrees) 

Figure 5.A.4 

5.A.(c) Defining the sin, cos and tan of angles of any size 

There is no general Tangent Rule which works for any triangle, like the Sine and Cosine 
Rules, so we have no simple way to sketch the continuation of the graph for tan x. 

It would be good to have a definition for the sin, cos and tan of angles of any size so that 
we wouldn't have to rely on what is apparently happening physically, although, to be useful, 
any definition would have to fit in with observed wave phenomena. 

We shall now do this by using the turn or angle measured out on a circle. (We have already 
used this method for showing the turn of angles in Figure 4.C.1 in the last chapter.) 

We consider the rotation of a unit length through a full turn about the origin, in an 
anticlockwise direction from the positive x-axis. I have shown this in four separate diagrams 
which show rotations round to each quadrant or quarter-circle, in turn. The angles of rotation 
are shown shaded. 

You can think of OP as a rod of length one unit which is turning about O. 
First quadrant 

In the first quadrant, shown in Figure 5. A. 5, the definition exactly tallies with the definitions 
given at the beginning of the last chapter in Section 4.A.(a) for the sin, cos and tan of angles 
between 0° and 90°. I have used the symbol 0 for the angle here, as I want to keep x for the 
length OX. (0 is the Greek letter theta.) 




Figure 5.A.5 
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We use the right-angled triangle OPX, and say 

PX PX 

sin 9 = = = v 

OP 1 

OX OX 

cos 9 = = = x. 

OP 1 

Both sin 9 and cos 9 are positive since they are measured along the positive x andj axes. 
PX y 

tan 9 = = — so tan 9 is positive, also. 

OX x 



It is very important that this new definition is giving sin 9 and cos 9 as 
measurements along the y- and x-axes respectively - so important that I 
suggest that you use one colour for y = sin 9 and another for x = cos 9 here, 
and on the following three diagrams. 



Second quadrant 

The angle we are considering is now between jt/2 and n radians (or 90° and 180°.) Again, 
we use the right-angled triangle OPX for our definitions. 




Figure 5.A.6 



We say 

sin 9 - 

cos 9 



PX PX 

OP 1 

OX ox 

OP 1 



x. 



This time, although y is positive, x will now be negative since it is measured along the 
negative x-axis, so sin 9 is positive but cos 9 is negative. This agrees with what we found 
when we used the Sine and Cosine Rules for angles larger than 90°. 

PX y 

tan 9 = = — so it is also negative. 

OX x 

We can see from the diagram that sin(7T - 9) = sin 9 and that cos(?r - 6) = - cos 9. 
(jt - 9) = APOX in size, so it would come in the first quadrant. 
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Third quadrant 

Again using the right-angled triangle OPX for our definitions, we say 



PX PX 

sin 9 = = = v, 

OP 1 

OX OX 

cos 9 = = = x. 

OP 1 



This time, both sin 9 and cos 9 are negative, since they are measured along the negative y 
and x axes respectively. 

PX y 

tan 9 = = — so it is positive. 

OX x 

We also see from the diagram that sin 9 = -sin(# - jc) and cos 9 = -cos(# - ji). 
(6 - 71) = APOX in size, so it would come in the first quadrant. 

Fourth quadrant 

Again using the right-angled triangle OPX for our definitions, we have 

PX PX 

sin 9 = = = v, 

OP 1 

OX OX 

cos 9 = = = x. 

OP 1 




-t 



Figure 5.A.7 




l -3 



-I 



—i 



Figure 5.A.8 



5.A Trig functions of any size of angle 



181 



We see that sin 9 is negative, and cos 9 is positive, from the positions of y and x on the two 
axes. 

PX y 

tan 9 = = — so it is negative. 

OX x 

We also see that sin 9 = - sin(2jr - 9) and cos 9 = cos(2jt - 9). 
{In - 9) = /LPOX in size, so it would come in the first quadrant. 

You can see from these four diagrams that, by using the right-angled triangle OPX'm each 
quadrant, we have now defined the sin and cos of the angle 9 in terms of the shadow or 
projection of the unit length OP on the x-axis for cos 9 (the distance shown as x in the 
diagrams), and the shadow or projection of OP on the _y-axis for sin 9 (the distance shown 
as y in the diagrams). If you have highlighted x and y with two different colours on these 
diagrams, it will emphasise for you, when you look back at them, where the sin and cos are 
and how they are changing. 

The + or - signs automatically follow from where the projections lie on the two axes. You 
may find it helpful to use the picture shown in Figure 5.A.9 to remember the changing signs 
for sin, cos and tan in a complete turn. 




Figure 5.A.9 



The letters A S T C stand for whatever is positive in that particular quadrant. A = 'all', 
S = 'sin', T = 'tan' and C = 'cos'. This can be remembered by a catch-phrase if you like, 
such as 'All Silly Tom Cats.' 

When OP has turned through an angle of In it will have returned to its original position. 
(It has completed one cycle.) If we then continue to rotate it, the whole identical process will 
be repeated with each new full turn or cycle. 

We can obtain negative angles by rotating OP in the opposite direction, so we would 
rotate it clockwise from the positive x-axis to get these angles. 

Plotting the graphs for y = sin 9 and y = cos 9, using the definitions which we have just 
given, will give us identical graphs to the ones in Figure 5.A.2 which we know describe 
actual physical happenings. 



5.A.(d) How does X move as P moves round its circle? 



Suppose the point P is moving round the circle shown in Figure 5.A.10 at a 
steady speed, starting from the points. Suppose that the radius of the circle 
is 1 m (metre), and that, after one second, P has moved a distance of 1 m. 
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Figure 5.A.10 



Try answering the following questions. 

(1) What angle (in radians) has the line OP turned through after one second? 
(See Section 4.D.(a) if you need help with radians.) 

(2) How long will it take P to make a full turn round the circle? 

(3) How far is the point X from O after a time of 

(a) 0 seconds, (b) 1 second, (c) 1.5 seconds, (d) nil seconds, (e) n seconds, 
(f) 3tt/2 seconds, and (g) t seconds? 

(4) As P turns round the circle at its steady speed, how is the point X moving? Does 
it also have a constant speed? If not, when do you think it is moving fastest? When 
is it moving slowest? 



These are the answers which I hope you have found. 

(1) One radian. We say that the angular velocity of P is one radian per second. 

(2) A full turn is 2ji radians, so 2jt seconds. 

(3) (a) OX= 1 m. (b) OX= cos t = cos 1 = 0.54 m to 2 d.p. 

(c) After 1.5 seconds, OX= cos 1.5 = 0.07 m to 2 d.p. 

(d) After jt/2 seconds, X is at O, so OX = 0. 

(e) After Jt seconds, the distance OX is again 1 m as P is now at B. We can think of 
this distance as negative, since it is measured in the opposite direction to OA. 

(f) After 3tt/2 seconds, OX = 0. 

(g) After t seconds, OX = cos t metres. If we let OX = x, we could write the 
equation giving the position of X after time t as x = cos t. 

(4) X is not moving at a constant speed. It moves fastest as it passes through O and 
slowest at the points A and B when it instantaneously comes to rest before turning 
back on itself. 

The point X is moving in what is called simple harmonic motion or SHM. 

Surely, if we know the distance or displacement of X from O at any time, we 
have enough information to discover its speed exactly? Indeed we have, and we 
shall be able to do just this in Section 8.A.(e). 

5.A.(e) The graph of tan 6 for any value of 6 

Using tan 9 = ylx = sin 0/cos 9 in the four diagrams of Figures 5.A.5-5.A.8, we can now 
define tan 9 for any size of angle 0. We can therefore draw the extended graph of y = tan 9 
which I've done in Figure 5.A.11. 
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Figure 5.A.11 



What special properties does this graph have? Make a note of as many as you can. 



The graph shows these special properties. 

• It is periodic, but the period of repeat this time is it rather than lit, as it was for sin 6 
and cos 0. 

• It is odd, that is, if you rotate it through half a turn about the origin, it fits exactly onto 
itself, so if you turn the page upside down you get the same graph. Equally you could 
say that, if you reflect it through the y-axis, it reverses its sign, so 

tan x = - tan(-x). 

• The tan of an angle just less than jt/2 (or 90°) is very large and positive. 

• The tan of an angle just greater than Jt/2 (or 90°) is very large and negative. 

• There is a jump or discontinuity in the graph when 6 = jt/2 and we therefore see that 
the tan of 90° can 't be given a value, and any calculator asked to display it will give 
an ERROR message. The same thing happens for all odd multiples of nil, so on the 
graph we see it happening at 

it it jt Jt 

-IX—, + 1 X — and + 3 X — and + 5 X — . 
2 2 2 2 

The graph has a vertical asymptote for each of these values of 9, just as the graph in 
Section 3.B.(i) had a vertical asymptote of x = 2. 

5.A.(f) Can we find the angle from its sine? 

In Figure 5.A.12, I show again the graph of y = sin x for values of x from -Jt to 2jt. 

From this graph, find x for these values of sin x. 

(a) sin x = 1 (b) sin x = 0 (c) sin x = -1 (d) sin x = \ (e) sin x = - \. 
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Figure 5.A.12 



Here are the answers which you should have found. 

(a) x = nil. 

(b) As soon as we try this one, we find that we've got a more complicated situation. 
There are four possible values of x on this graph for which sin x = 0. 

We can have x = —jt or x = 0 or x = jt or x = 2jt. 

(c) Similarly, if sin x = -1, from the graph we have x = -?t/2 or + 3?r/2. 

(d) If sin x = \, then from the graph we have x = tt/6 or 5jt/6. 

(e) If sin x = - 5, then from the graph we have x = -5nl6 or -jt/6 or 7tt/6 or 1 ljt/6. 

We can see that extending the graph further in either direction would give us more 
solutions for x for any given value of sin x, and that there are, in fact, an infinite number of 
possible solutions. 

Although this infinitely repeating possibility will be very important in describing some 
situations, such as those involving waves of one kind or another, in many other 
circumstances they will just be an awkward embarrassment. If you have sin x = 0.6, for 
example, and you want to find an angle from this on your calculator, you don't really want 
it to try to flash up an infinite number of answers for you. 

So what do we do? 

It would make sense for us to restrict the possible angle shown for a given sin to a short 
range so that we only get one answer, but every possible value for sin x is included that is, 
we have all values of sin x from -1 to +1 . If we do require further answers, we can then find 
them using the repeating pattern of the graph. (We shall look into this in more detail later 
on in Section 5.E.(d).) 

We shall want to include 0° to 90° (or 0 to jt/2 radians) in our range because this is the 
cradle of civilisation as far as trig is concerned - it all started with right-angled triangles. But 
this will only give us answers for positive values of sin x, so what should we add to it? 



We see from the graph that if we add -90° to 0° (or -jt/2 to 0 radians) we shall be all 
fixed up. 

Then if, for example, sin x = -0.4, using INV or SHIFT or 2nd Function Sin on your 
calculator (in degree mode) should give you an angle lying between -90° and 0°. Try it 
and see. 

You should get -23.6° to 1 d.p. 
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It would have been no good trying to extend the range by adding on 90° to 1 80° because 
this would have just given us repeats for the positive values of sin x and no solutions for the 
negative values. 

Exactly the same sort of problem with multiple solutions will happen if we want to find 
an angle from its cos. 

Look back to the graph of y = cos x in Figure 5.A.3(b) and decide for yourself what you 
think a sensible range for the answers would be. 

What do you think you should have for x if cos x = 5? 

What should you have for x if cos x = -5? 

Test out your ideas by seeing if your calculator agrees with you. 



You should have the range from 0° to 180° this time (or 0 to n radians). This then gives 
you one and only one possible solution for any value of cos x between -1 and +1, and 
includes those important angles between 0° and 90°. 

Using this range gives x = 60°, or jt/3 radians, if cos x = 5, and x = 120°, or 2tt/3 radians, 
if cos x = -5. 

What we have cunningly done here, by restricting the range of values which we will allow 
for the angle from a given sin or cos, is to give ourselves inverse functions to take us back 
from a known sin or cos to just the one possible angle. (If you need help with inverse 
functions, you should go back now to Section 3.B.(g).) 

We have already dealt with a similar situation to the one which we have here when we 
were looking for an inverse relation forJ{x) = x 2 in Section 3.B.(j). There we also found that 
we could define an inverse function by restricting the possible values for x. 



Don't panic! We have just found them. 

sin -1 x is the inverse function which takes us back from a value of sin x to an angle with 
that sin, and cos -1 x is the function which takes us back from a value of cos x to an angle 
with that cos. The possible values of these angles are restricted in the way we have just 
decided above will make sense. 

With these restrictions, there is only one possible value for the angle from a given sin or 
cos, which is a condition which we must have for a relation to be a function as we saw in 
Section 3.B.(c). 



Two inverse trig functions 

sin -1 x is the angle in the range from -90° to +90° 
(or - nil to +jt/2 radians) whose sin is x. 

cos -1 x is the angle in the range from 0° to 1 80° 
(or 0 to Jt radians) whose cos is x. 

shr 1 x is sometimes called arcsin x and cos -1 x is sometimes called arccos x. 



5.A.(g) 



sin 1 x and cos 1 x: what are they? 




sin 1 x does not mean 1/sin x. This would be written as (sin x) .It is one of 
those tricky bits of mathematical notation which make a trap for the unwary 
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5.A.(h) What do the graphs of sin" 1 x and cos" 1 x look like? 

We can use the method which we found in Section 3.B.(g) to draw a sketch of these two 
graphs. 

Since the inverse relations take us from the y values back to the original x values, their 
graphs are mirror images of the original graph in the line y = x. The sketches will be easier 
to draw if we take equal scales on the two axes. We then get graphs as sketched in Figure 
5.A.13 and Figure 5.A.14. If you are sketching these graphs for yourself, you may find it 
helps if you use the helpful hint I suggested for complicated inverse sketches in Section 
3.B.(i). If you use equal scales on your two axes, and turn your paper so that the line y = x 
is vertical, it is much easier to sketch the mirror image of f(x) in the line y = x which gives 
you the graph of f~ l (x). 

You can see from Figure 5.A. 13(a) that, without the restrictions, the inverse relation is not 
a function - extending the graph would give an infinite number of solutions to 'y is the angle 
whose sin is x.' (Remember the raindrop test in Section 3.B.(c).) 




You can also see how we have forced a function from this relation by restricting the range 
of values which we will accept. 

This is shown in the graph in Figure 5 .A. 13(b) which represents the function 'y is the 
angle in the range from -Till to +jiI2 radians whose sin is x.' Notice that this function is only 
defined for values of x lying between -1 and +1 inclusive, that is, -1 < x < +1, because this 
is the range of possible values for sin x. 

Similarly, the graph in Figure 5.A. 14(a) shows the repeated solutions of 'y is the angle 
whose cos is x\ while Figure 5.A. 14(b) shows the function 'y is the angle in the range from 
0 to 71 radians whose cos is x\ which gives a single solution for y for each x. 

Again, -1 < x < + 1. 




Figure 5.A.14 



I think it will help you a lot here if you put your own two colours on each of the pairs 
of graphs of y = sin x and y is the angle whose sin is x, and y = cos x and y is the angle whose 
cos is x. It's much easier then to see which wiggle belongs to which. 
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5.A.(i) Defining the function tan" 1 x 

To do this, we need to look at the graph of y = tan x which I show in Figure 5.A.15. 

We see from this graph that, for any given value of tan x, there will be an infinite possible 
number of angles x which have this tan value. For example, if tan x = 1 then, from the graph, 
we could have x = ji/4 or 5jt/4 or 9tt/4. Clearly, there are infinitely many more answers 
stretching out in both the right-hand and left-hand directions. 




Figure 5.A.15 



To define the function tan -1 x, we shall again have to restrict the possible range of angles 
which we will allow. We certainly want to include 0 to ji/2 and we could extend the range 
so as to go either from -jt/2 to +jt/2, or from 0 to jt in order to get just one possible solution 
for the angle from each possible value of tan x. 

The agreed convention is that we take the range from -jt/2 to +jt/2. 

I show a sketch of the graph of y = tan -1 x below, in Figure 5.A.16. 

I've drawn it by using the reflection in the line y = x of the graph of y = tan x for values 
between - jt/2 and jt/2. 

Again, using two colours, one for each of tan x and tan -1 x, will make the two graphs 
stand out more clearly for you. 



J= tan x 



J= tan" 'x 




Figure 5.A.16 
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5.B 



5.B.(a) 



The trig reciprocal functions 



What are trig reciprocal functions? 



The reciprocal function of a function, /(x), is defined as 
The three trig reciprocal functions are 



fix) 



1 1 

; = (sin x) 1 = cosec x, = (cos x) 1 = sec x, 



sin x 
1 

tan x 



cos x 



= (tan x) 1 = cot x. 



Remember that these are not the same as the inverse functions, 
sin -1 x, cos -1 x and tan -1 x. 



5.B.(b) The trig reciprocal identities: tan 2 8 + l = sec 2 6 and cot 2 0 + l = cosec 2 0 

In Section 4.A.(h), we used Pythagoras' Theorem to show that the three identities, 



sin 2 9 + cos 2 0=1, 
tan 2 6 + 1 = sec 2 0, 
cot 2 0 + 1 = cosec 2 0, 



are true for any angle 0 which is less than 90°. 

These three identities will remain true for any angle 0 since, as we have seen in Section 
5.A.(c), we still have the right-angled triangles. Although negative values for the sin, cos and 
tan of 6 are now possible, when they are squared they become positive, and therefore the 
three identities remain true. 



5.B.(c) Some examples of proving other trig identities 

Students quite often find this process difficult, so we shall now look at some examples of 
how it is done. 



1 

example (i) Prove that tan 0 + cot 6 = for any angle 6. 

sin 0 cos 0 

^^^^ We have to show that the two sides are equal, so we mustn't write them down 
^H-^V as equal from the start. 
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Instead, we deal with the sides separately. Here, 

sin 0 cos 0 sin 0 sin 0 cos 0 cos 0 

LHS = tan 0 + cot 0 = + = X + X 

cos 0 sin 0 cos 0 sin 0 sin 0 cos 0 

sin 2 0 cos 2 0 sin 2 0 + cos 2 0 1 

= + = = = RHS. 

sin 0 cos 0 sin 9 cos 0 sin 0 cos 0 sin 0 cos 0 

Just like adding any other fractions, we make it possible to put them 
over the same denominator in the first line of working above - see 
Section l.C.(c) if necessary. 

example (2) Try showing that sec 2 0 + cosec 2 0 = sec 2 0 cosec 2 0 for yourself. 
It looks quite an unexpected result! 



You could do it like this: 

1 1 sin 2 0 cos 2 0 

LHS = sec 2 0 + cosec 2 0 = + = + 

cos 2 0 sin 2 0 cos 2 0 sin 2 0 sin 2 0 cos 2 0 

sin 2 0 + cos 2 0 1 

= = = sec 2 0 cosec 2 0 = RHS. 

cos 2 0 sin 2 0 cos 2 0 sin 2 0 

I say above 'you could do it like this' because identities can usually be proved in a large number 
of different ways. This is because the process is a bit like following a maze; you can write down 
a sequence of true statements starting from one side, but they do not always bring you any 
closer to the other side. Sometimes, after much effort, they bring you back where you started - 
at least you know then that what you have written down is true if not helpful. 

Usually it is best to start with the more complicated side and show that this can be 
reduced to the simpler side. In really tough cases, it pays to work on each side separately and 
bring both of them to some third form. (The example which we have just done can be proved 
very neatly by using the two relevant identities of Section 5.B.(b) on each side in turn. Try 
it and see!) 

Because there are all these possible branches to follow, you should never spend too long 
trying to prove an identity in an exam. If it doesn't come out quite quickly, leave it and return 
to it later if you've got time. 

Have a go at the one below too. It is a bit tricky, but you have all the working knowledge 
and skills to get through it all right. We'll take it in stages. 

cos x sin x 

example (3) Show that + = sin x + cos x for any angle, x. 

1 - tan x 1 - cot x 

The LHS is more complicated, so we will work with this and try to 
show that it is the same as the RHS. It would seem to be a good idea to 
have the whole of this side in terms of sin x and cos x. How can we 
rewrite tan x and cos x to do this? 
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We can put 

sin x cos x 

tan x = and cot x = 

cos x sin x 

then, at least, everything is in terms of sin x and cos x. Then 

cos x sin x 
LHS = — + 



j sin x j cos -t 
cos x sin * 



Now what should we do? (See if you can tidy up what we've now got.) 



We get rid of fractions inside fractions by multiplying the first bit 
top and bottom by cos x, and the second bit top and bottom by sin x. 
(Try doing this if you didn't already.) 



You should get 



cos 2 x sin 2 x 



LHS = + 

cos x - sin x sin x - cos x 

Using sin x - cos x = -(cos x - sin x), how can we rewrite what we've 
now got? 



We can say that 
LHS = 



cos 2 x sin 2 x cos 2 x - sin 2 x 



cos x - sin x cos x - sin x cos x - sin x 
How can we rewrite the top? (Try using a neat factorisation.) 



cos 2 x - sin 2 x = (cos x - sin x) (cos x + sin x) 
(using the difference of two squares) 

Try to finish it off now. 



(cos x - sin x) (cos x + sin x) 

LHS = = cos x + sin x = RHS. 

cos x - sin x 

#You may have recognised that cos 2 x - sin 2 x could also be written as cos 2x. 
Although this is true, it would not have helped us here. The trickiest part in 
proving identities is picking out the possible steps which will also lead you 
forward in the proof. 
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5.B.(d) What do the graphs of the trig reciprocal functions look like? 

We start by thinking about how we can draw a sketch of the graph of 

1 

y = cosec x = . 

sin x 

I show in Figure 5.B.1 a sketch of y = sin x to work from. 




To help us, we need first to answer the following five questions. 

(1) When sin x = 1, what is cosec x? 

(2) When sin x = -1, what is cosec x? 

(3) Does cosec x have the same sign as sin x? 

(4) What happens to cosec x when sin x is positive but very close to zero? 

(5) What happens to cosec x when sin x is negative but very close to zero? 

Try answering each of these for yourself. 



Here are the answers. 



(1) cosec x = 1 (2) cosec x = -1 (3) Yes it does, since it is just 1/sin x. 

(4) cosec x becomes very large and positive. 

(5) cosec x becomes very large and negative. 

(When sin x = 0, cosec x is undefined because we can't divide by zero.) 
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exercise 5.B.1 Using the answers to the five questions above, try to sketch in for yourself the 

graph of y = cosec x on the sketch I have already drawn for you of y = sin x. Use 
pencil so that you can have second thoughts if necessary! 

(The sketch is shown in the answers at the back of the book, but it is 
important to try to draw it yourself before looking.) 

Because the functions of y = sin x and y = cos x are periodic, so also are the functions 
of y = cosec x and y = sec x. The graph of y = sec x is the same as the one for y = cosec x 
shifted by jt/2 to the left. 

(Strictly speaking, when we say that y = cosec x and y = sec x are functions, we must 
exclude any value of x which would involve dividing by zero, as this is impossible.) 

exercise 5.B.2 Using the same methods as you used for sketching y = cosec x, try sketching for 

yourself the graph of y = cot x (that is, the reciprocal graph of y = tan x), using 
the sketch of y = tan x which I have drawn for you in Figure 5.B.2. 

To do this successfully, you will need the answer to one more question. 

What happens to cot x as tan x becomes very large? 




Figure 5.B.2 



cot x will become closer and closer to zero, so that when tan x is undefined, say for 
x = jt/2, cot x = 0. 

5.B.(e) Drawing other reciprocal graphs 

Drawing and checking the two reciprocal graphs of y = cosec x andj = cot x will have shown 
you many of the basic guidelines to use when drawing reciprocal graphs. 

I will summarise these here for you in a box. Then you will be able to have a go at 
drawing reciprocal graphs for some of the functions which have been mentioned in earlier 
chapters. 
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Rules for drawing reciprocal graphs 

If we have a function y =/(x), its reciprocal function is y = llf{x). 

• If the graph of y =f(x) has symmetries (for example, being odd or even or 
periodic), then the graph of l/f(x) will have the same symmetries. 

• If y =f(x) = 0 for some value of x, then \lf(x) is undefined. There is a jump or 
discontinuity in its graph for this value of x. 

This means that, as f(x) gets close to 0, \lf(x) will become very large in value. 
Equally, if there is a jump or discontinuity in the graph of y =f(x) for some 
value of x, then y = \lf(x) = 0 for that value of x. 

• If you know a few key values for y =f(x), it is easy to calculate the 
corresponding values for_y = l/f(x). These can then be used to help you to get 
the sketch in the right place. 



exercise 5.B.3 Using the rules above, try drawing in the reciprocal functions for the six functions 

shown on my graph sketches. Use any values given on my sketches to write in the 
corresponding values on the reciprocal sketches. 

In case some of these functions are unfamiliar, I have given you a reference back 
to where I have talked about them earlier in this book. 

I suggest that you sketch them first in pencil to allow for second thoughts. When 
you have got them right, it might help you to use two colours on them (one for 
the original graph and one for its reciprocal), to emphasise how they depend upon 
each other. 

1 

(1) Sketch y = using my sketch of y = x 2 - ix + 2 = (x - 1) 2 + 1. 

x 2 - IX + 2 

(My sketch uses Sections 2.D.(b) and (c) on completing the square and graph 
sketching.) 

1 

(2) Sketch y = using my sketch for y = x 2 - 4* + 3 = (x - i)(x - 3). 

x 2 - 4* + 3 

(3) Sketch the graph of y = i/x using my sketch of y = x. 

(4) Sketch the graph of y = 1/x 2 using my sketch of y = x 2 . 

(5) Sketch the graph of y = i/e* using my sketch of y = e*. 
You may find that Section 3.C.(f) helps you here. 

x - 2 x + 3 

(6) Sketch the graph of y = using my sketch of y = . 

x + 3 x - 2 

(We drew this sketch in Section 3.B.O).) See if you can also find the coordinates of 
the point where this graph and its reciprocal graph cross over each other.) 
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Figure 5.B.3 



5.C Building more trig functions from the simplest ones 

5.C.(a) Stretching, shifting and shrinking trig functions 

In Section 3.B.(d), we looked at what happens to functions when we add or multiply them 
in different ways. You should look back at this section if you haven't yet read it, and make 
sure that these ideas are familiar to you. I have summarised the effects of the simplest kinds 
of transformation there. 
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Because trig functions are periodic, particularly interesting possibilities of combination 
arise which have profound physical implications. In particular, they are very useful in 
thinking about mechanically vibrating systems and the behaviour of current and voltage in 
electric circuits. They can also be used to describe the different qualities of particular notes 
played on different musical instruments. 

We have already seen that, because these functions are periodic, and because of their 
symmetries, they are very closely related to each other. For example, the cos curve y = cos x 
is the same as the sin curve y = sin x except that the sin curve has been shifted Till to the 
left (Figure 5.C.1). 




Figure 5.C.1 



Using the second result from the summary at the end of Section 3.B.(d), we see that this 
means that sin(x + jt/2) = cos x. 

Combinations of sin and cos functions are often used to describe how various kinds of 
wave motion change with time. In this case we would need to have the horizontal axis in the 
graphs representing time, and so it is better to use t rather than x for the variable on this axis. 
The vertical axis is then measuring some displacement, so it is often labelled x, with x being 
a function of time, t. 

Because so many of the different kinds of waves which occur in the natural world can be 
represented by various combinations of trig functions, these functions are often called wave 
functions or waveforms. 

Using the results summarised in Section 3.B.(d), we can sketch graphs for functions such 
as x = 3 cos t, or x = cos It. I show the sketches for these in Figure 5.C.2(a) and (b). In each 
case, the graph of x = cos t is shown by a dashed line. 



A. X 



oc- 3 cos t 




x = cos 2t 
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In the graph of x = 3 cos t, each value of cos t has been pulled out three times as far from 
the /-axis. 

In the graph of x = cos 2t, each point of the curve as we move out from t = 0 is being 
reached twice as fast. So, if t = jt/2, cos it/2 = 0 but cos(2 X it/2) = cos it = -1. 
We can now use these two graphs to illustrate some important definitions. 

• The maximum displacement or amplitude, A, is 3 units in (a), and 1 unit in (b). 

• If t is in seconds, the period, T, or time taken for each complete cycle is 2it seconds 
in (a), and it seconds in (b). 

• The frequency,/, which is the number of cycles per second, is \llit in (a), and I /it in 
(b). The units for frequency are hertz, written as Hz. 



1 

T and / are related by the equation T = — . 



exercise 5-c.i Using the results from Section 3.B.(d), and the two examples shown in Figure 

5.C.2 in this section, try sketching the following six wave functions for yourself in 
pencil using my drawings in Figure 5.C.3 on the next page. I have already drawn in 
the graph of x = sin r on each of them, to help you. 

(1) x = 2 sin f (2) x = sin 2f (3) x = sin (f/2) (4) x = 1 + sin f 

(5) x = cos f (6) x = cos (f + Jt/2) 

Also, for each wave function, answer the following questions. 

(a) What is its amplitude, A? 

(b) What is its period, 77 

(c) What is its frequency, /? 

(d) Is the function odd or even? 

(e) If co = 2jr/Tfind co in each case. The physical interpretation of co is described 
in the next section. 

Then check your results against the answers in the back of the book. (If 
necessary, draw the graph sketches in again so that you have the right version.) 

5.C.(b) Relating trig functions to how P moves round its circle and SHM 

We can also think about the two functions whose graphs we sketched in Figure 5.C.2(a) and 
(b) in the last section by relating them to the motion of X as P moves round its circle. I 
described this in the thinking point of Section 5.A.(d). We looked there at how the distance 
x = cos t was changing as P moved round the circle with an angular velocity of 1 rad/s. Have 
another look at this thinking point now. 

Can you see how you could draw two similar pictures to show how P would be moving 
to give (a) OX = x = 3 cos t and (b) OX = x = cos 2ft 



x = 3 cos t would be illustrated by the motion of Xif P moves round a circle with a radius 
of 3 units, but still with an angular velocity of 1 rad/s. I show this in Figure 5.C.4(a). As P 
moves round this circle, the distance OX = x varies between the two extremes of +3 and -3 
units, corresponding to the amplitude of 3 in Figure 5.C.2(a). 

x = cos 2t would be illustrated by the motion of X if P moves round a circle of radius one 
unit, but twice as fast, so its angular velocity is 2 rad/s. I show this on Figure 5.C.4(b). 
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Figure 5.C.3 




P oJfr tec ti me t 
I rad/s 

3 when t=o 



x = 3 cos t 




when t=0 



X - Cos at 



Figure 5.C.4 
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In each case, I have shown the displacement x after time t as a thick black line. Because 
these changing displacements are very important in many physical applications, you may 
like to highlight them for yourself in colour in the same way that I suggested you should for 
the four pictures showing the definitions for the sin and cos of angles greater than 90° in 
Section 5.A.(c). 

In both cases, the point X is moving in what is called simple harmonic motion, or SHM. 
'Harmonic' is just another way of saying 'periodic' - used because sound waves are 
produced by combinations of waves of this kind. The word 'simple' is used here because we 
are looking at a motion which can be described by a single cos. 

SHM also describes many other important physical situations. Often these involve an 
object being slightly displaced from its equilibrium position. Examples of this are the 
motion of a weight hung on a spring which is slightly pulled down from its equilibrium 
position, and the motion of a small weight hanging on a long string which is pulled slightly 
to one side and then released so that it moves as a simple pendulum. Again, the 'simple' 
means that the motion can be described in terms of a single cos or sin. 

If a point X moves in SHM it is called a harmonic oscillator. Harmonic oscillators are 
fundamental to the understanding of physical systems. Amazingly, any real-life situation 
involving small vibrations, however complicated it is, can be reduced to a system of 
harmonic oscillators. 

If we write the equation of motion of X as 



x = A cos cot 



then A is the amplitude and to is the constant angular velocity of the point P. 
co is called the angular frequency of the wave described by this equation. 
(co is the Greek letter called omega.) 

In the two examples we have just looked at, we have the following results. 

(1) If x = 3 cos t, then A = 3 and co = 1. We also saw that T = In and/= Win. 

(2) If x = cos It, then A = 1 and co = 2. We also know that T = jt and /= \lit. 

2jt co 
We also have the relations that T = — and /= — . 

co 2ti 



If, in the simplest case described in the thinking point of Section 5A.(d), where P is 
moving round its circle of radius one unit, at a constant angular velocity of 1 rad/s, we had 
looked at the motion of the point Y on the vertical axis instead, we would have had the 
equation for OY of y = sin t (Figure 5.C.5). This is also SHM. Now, when t = 0, y = 0 
also. 

The point 7 is starting from the central position of its motion, unlike X which started from 
its most extreme positive position. 

These circle diagrams make it much easier to see what is happening with more 
complicated sin and cos functions. Such functions are very important in physical 
applications such as describing the voltage and current waveforms in electric circuits. It is 
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^Lj= sin t 



Figure 5.C.5 



much simpler to handle them mathematically through the use of complex numbers and the 
first step in doing this is to become happy with using these circle diagrams. 

I have already drawn for you the examples of x = 3 cos t and x = cos 2t in Figure 5.C.4, 
and y = sin t in Figure 5.C.5. Since I have used x to represent the displacement after time 
/ on all my graph sketches, I shall also use it from now on to show displacements on both 
the horizontal axis of my circle (which gives a cos function), and on the vertical axis of my 
circle (which gives a sin function). 

Here are two more examples showing this kind of relationship. 

example (l) Show the relation of x = 2 sin 3? to the motion of P round its circle. 




, P after time, t 



START 
when t=o 

0J=3 rad /$ 
X= 2. Sin 3t 



Figure 5.C.6 



I show this on Figure 5.C.6. The maximum value of x is 2, therefore 
A = 2, and the radius of the circle must be 2 units. When t = 0, x = 0. After a 
time t, x = 2 sin 3t, so P is moving with an angular velocity of 3 rad/s 
therefore m = 3. A full turn or cycle takes 2jt/3 s so T= 2 ti/3. 

example (2) Show the relation of x = cos (t + jt/6) to the motion of P round its circle. 




V after fcirne t 



START 
when t=0 



co=t rod/s 

X=COSa + *%) Figure 5.C.7 
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I show this on Figure 5.C.7. The maximum value of x is 1, so A = 1 
and the radius of the circle must be 1 unit, x = cos tt/6 when t = 0. 
Notice that x would have been equal to one unit jr/6 s before the instant 
when we took t = 0. After a time of t, x = cos (f + 7i/6). P is moving 
with an angular velocity of 1 rad/s, so a> = LA full turn or cycle takes 
2jt s so T = 2tt. 



Now have a go at these yourself. 

Draw sketches showing the motion of the point P round its circle for each of 
the following: 



(l) x = cos 3f 
(4) x = 4 sin (f/2) 
(7) x = 2 cos (f - jt/6) 



(2) x = 2 sin r 

(5) x = sin (f + jt/6) 

(8) x = 5 sin (3f + jt/6). 



(3) x = 3 cos 2t 

(6) x = sin (2f + Jt/ii) 



Label each sketch in a similar way to my two examples. In each case, you should 
also give the value of the amplitude, A, and of the angular velocity, a>, and of the 
period, T. It is very important to actually do these sketches yourself; don't just 
look at my answers. 



5.C.(c) New shapes from putting together trig functions 

What happens if we add sin t to cos (t + tiI2)1 (Have a look at your sketch for question (6) 
of Exercise 5.C.I.) 

What happens if we add sin t to cos ft Try sketching for yourself what the result would 
be in each case. 



In (6), because cos (t + nil) = -sin t, the result of adding the two waves is always zero. 
They are exactly out of phase with each other. 

I show in Figure 5.C.8 a sketch for x = sin t + cos t drawn from putting together the two 
curves x = cos t and x = sin t and marking in all the easy points such as where one of them 
is equal to zero, or they are equal to each other and so just double, or they are equal but 
opposite in sign and so balance out. 




1 . -J- . — _j . _i 1 1 1 i . 1 • 1 1 

3f §f 3f 5f Of !3f iff j^r 

Figure 5.C.8 
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We see from this sketch that x = sin t + cos t has an amplitude of 2 sin(jr/4) = 2 X l/ v '2 
= y[2, and a period of 2jt. 

It looks as if it might also be sin-shaped. (We shall find out how to show that it is a sin 
curve in Section 5.D.(f).) 



Sketching graphs by hand becomes very time-consuming (and difficult if the functions 
are more complicated), but if you have access to a graph-sketching calculator or computer 
it would be good to see what happens when you add all the pairs of functions in the six 
graphs shown in the answers to Exercise 5.C.I. 

It is also very interesting to see what happens if you add a sequence of sines. You will 
see that the shape of the resulting curve gets successively modified to give some remarkable 
results. 

Here are two examples you could try. 

(I have used the — > symbol here to mean 'put in the next bit of the sequence and see how 
it affects your graph.') 

/ sin 2A / sin 2t sin 3A 

(1) (sin t) -» I sin t - — - — I -» I sin t - — - — + — - — I -> . . . 

/ sin 3A / sin 3t sin 5A 

(2) (sin t) -» I sin t + — — I I sin t + — — + — - — I -> . . . 

The further you go with these sequences the more interestingly modified the shapes of the 
graphs become. 

By this kind of method it is possible to get graphs which are very close approximations 
to the ones shown in Figure 5.C.9, both of which are waveforms which can occur naturally 
in electrical signals. 

If you have done the experiments of (1) and (2) above, you will find that you get 
increasingly good matches except for little overshoots close to the vertical parts of the graph. 




3n 



3n 



-t ■ 

Figure 5.C.9 
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This is called Gibb's phenomenon and it comes from the problems in accurately representing 
a graph which is effectively doing a jump at these points. 

The fact that these functions can be thought of as sums of sines (or, more generally, to 
include other cases, as sums of sines and cosines) is of great practical importance. This 
whole area of what is called harmonic analysis was developed by the French mathematician, 
Fourier. 

Can you see why we couldn't represent any periodic functions just by sums of sines of 
multiples of t as in the two earlier examples I gave you? 



The sums of such sines will always give odd functions. If the function we want to 
represent isn't odd then we shall need also to include cosines of multiples of t to get a correct 
representation of what is happening. 

If the function is made up entirely from cosines of multiples of t it will always be 
even. 

Try the following sequence to see this happening. 

/ cos 3x\ / cos 3x cos 5x\ 

(cos x) — > cos x h — > cos x h 1 — > . . . 

\ 3 2 J \ 3 2 5 2 J 

5.C.(d) Putting together trig functions with different periods 

All the examples of putting trig functions together which we have looked at so far in this 
section have had periods which were the same as at least one of the input functions. For 
example, both sin t and cos t have a period of 2jt and x = sin t + cos t also has a period of 
2n. 

x = sin t + 3 sin 3t + j sin 5t has the period of 2ti belonging to sin t since all the other 
functions neatly sit inside this, (sin 3? has a period of j X 2ji, and sin 5t has a period of 
f X 2jt.) 

What happens if we put together trig functions with different periods? 
For example, suppose we take the case of x = sin(r/4) + sin(?/5). 
sin(r/4) has a period of Sji and sin(^/5) has a period of IOjc. 

The joint period when these two functions are added together, is given by the smallest 
number which both 8tt and IOjt divide into exactly (their l.c.m.), which is 40ji. This is the 
smallest number which can accommodate a whole number of cycles of both functions. 

I show in Figure 5.C. 10(a) a sketch of x = sin(r/4) and x = sin(?/5) on the same axes. 
Underneath that, in Figure 5. C. 10(b), I show a sketch of the joint function, x = sin(r/4) + 
sin(r/5) so that you can see how it comes from the two functions above. 

The complete cycle shown of x = sin(?/4) + sin(f/5) has a more complicated shape than 
its two building functions because, at the beginning and end of the cycle these two functions 
are quite close and so their sum produces roughly twice the displacement. 

Then, because sin(?/5) is changing more slowly, it gets more and more behind sin(r/4). 
This means that around the middle of the cycle the two functions are nearly cancelling each 
other out. 

By the end of the cycle, sin (t/5) has got so far behind that it gets lapped by sin (t/4), and 
the two functions are again close together. 

If the two building functions have periods which are very close together, then the contrast 
between the peaking effect at the two ends of each cycle and the level trough near its centre 
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Figure 5.C.10 



becomes very much more marked. A physical example of this is what happens if two musical 
notes, very close to each other in pitch, are played at the same time. The peaks are heard as 
beats which will disappear when the two notes exactly match. This phenomenon is made use 
of by piano tuners and by other musicians when they tune their instruments. 

5.D Finding rules for combining trig functions 

5.D.(a) How else can we write sin (A + 6)1 

\fA and B are two different angles, is it true that sin (A + B) = sin A + sin Bl 
Test your answer with two examples on your calculator. 



Except for some very special cases, such as when B = 0, it is not true that 
sin (A + B) = sin A + sin B. 

^^^^ Students sometimes write that sin 2A = 2 sin A, for example, but from the 
^H-^V first two questions of Exercise 5. C.I earlier it is clearly obvious that sin It 
and 2 sin t are not at all the same thing. 



Can we find a way of writing sin (A + B) using the sin and cos of A and Bl 

(As we shall see in Section 5.D.(f) it is often important to be able to do this.) 

To show this geometrically, we shall need right-angled triangles to work from. 

We start by drawing the tilted triangle for /LB, as this is the trickiest one to get, and then 
build up the diagram as I show in Figure 5.D.I. 

Then we complete this chain by drawing the triangle RNQ. This is because it gives us 
another right-angled triangle with lengths that we want. ARQN = /LA because NQP is a 
straight line, and so 180°, and the angles of AOQP also add to 180°. 
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Figure 5.D.1 

Then we have: 

RM PQ + QN OQ sin A + QR cos A 

sin (A + B) = = = 

OR OR OR 

OQ QR 

= sin A + cos A = cos B sin A + sin B cos A. 

OR OR 

This is more usually written as 

sin (A + B) = sin A cos B + cos A sin B. 



5.D.(b) A summary of results for similar combinations 

In a very similar way, we can get formulas for sin {A - B), cos {A + B) and cos(^ - B). 
(These can also all be shown to be true for angles larger than 90°.) 
These are listed in the box below: 



sin (.4 


+ B) 


= sin A cos B + cos A sin B, 


sin (A 


-B) 


= sin A cos B - cos A sin B, 


cos (A + B) 


= cos A cos B - sin A sin B, 


cos (A 


-B) 


= cos A cos B + sin A sin B. 
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^^^^ Notice the + and - signs in the middle of the formulas for cos (A + B ) and 
^B-^V cos {A - B). It makes sense that they should be this way round when you 

remember that cos (60° + 30°) = cos 90° = 0 but cos (60° - 30°) = cos 30° = 

V3/2. 



5.D.(c) Finding tan (A + B) and tan (A - B) 

How shall we set about getting a formula for tan(^4 + B)l We can say 

sin (.4 + B) sin A cos B + cos A sin B 

tan (A + B) = = . 

cos (A + B) cos A cos B - sin A sin B 

It would be nicer to have the answer entirely in terms of tan A and tan B. Can you see what 
we need to do to the top and bottom of this fraction to make this possible? 



If we divide top and bottom by cos A cos B, and cancel where possible, we shall get 



tan A + tan B 

tan (A+B) = 



1 - tan A tan B 



(Remember that each of the four separate chunks in the fraction is getting divided.) 
You should now be able to show for yourself that 



tan A - tan B 

tan (A - B) = . 

1 + tan A tan B 



5.D.(d) The rules for sin iA, cos iA and tan iA 

These follow immediately from the previous results, putting B = A. We get: 



sin 2A 


= 2 sin A cos A, 


cos 2A 


= cos 2 A - sin 2 A, 




2tmA 


tan 2A 






1 - tan 2 A ' 



In the case of cos 2A, it is possible to write this rule in two other ways, using the identity 
that sin 2 A + cos 2 A - I. We then get: 



cos 2A 


= cos 2 


A-(\ 


- cos 2 A) = 2 cos 2 A - I, 


cos 2A 


= (1- 


sin 2 A) 


- sin 2 A = I - 2 sin 2 A. 
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We shall find these alternative versions very useful later on in solving trig equations and 
for integrating sin 2 x and cos 2 x. I give you examples of this in Section 5.E.(d) and example 
(4) of Section 9.B.(c). 

5.D.(e) How could we find a formula for sin 3/1? 

We can now find a formula for sin 3A completely in terms of sin A. 

We do it by writing sin 3A as sin (A + 2A) and then using the sin (A + B) formula on this. 
Then we have 

sin 3A = sin (A + 2A ) = sin A cos 2A + cos A sin 2A 

= sin A{\ - 2 sin 2 A) + cos A(2 sin A cos A) 

(using the rules for sin 2A and cos 2A from the section above) 

= sin A - 2 sin 3 A + 2 sin A cos 2 A 

= sin A - 2 sin 3 A + 2 sin A(l - sin 2 A) 

= 3 sin ^4 - 4 sin 3 A. 

You should now be able to find a similar rule for cos 3A in terms of cos A for yourself. 
I have put this pair of rules in the box below for you: 



sin 3A = 3 sin A - 4 sin 3 A, 
cos 3 A = 4 cos 3 ^4-3 cos ^4. 



5.D.(f) Using sin (A + B) to find another way of writing 4 sin t + 3 cos f 

In Section 5.C.(c), we investigated graphically the effect of adding sin t to cos t for each 
value of t. The result seemed to be a sin curve which had been shifted by some angle from 
the origin. 

There are many physical and mathematical situations where it is much easier to deal with 
a single sin or cos function rather than having combinations of such functions. Such 
examples include describing the wave functions for alternating current and voltage, and 
making it easier to solve certain kinds of trig equation as we shall see in Section 5.E.(e). 

I will show you how we can do this conversion to a single function by taking the 
particular example of x = 4 sin t + 3 cos t. 

We start by noticing that 4 sin t + 3 cos t looks a little bit like 

sin A cos B + cos A sin B, which is sin (A + B) 
as we saw in Section 5.D.(a). So we try writing 

4 sin t + 3 cos t = R sin t cos a + R cos t sin a 

which is R sin (t + a). 

(We need to include the R here to avoid getting into the impossible position of needing 
a sin or cos greater than 1 .) 

We now have to find the particular numerical values of R and a which will make this 
equation be true for every value of t, so that each of the two sides is just another way of 
writing the same thing. This means that the equation is an identity and each separate part 
must match up, just as we matched up the separate terms in the identity in Section 
2.D.(h). 
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Here, the two sides will only be equal for every value of t if we have both the same 
quantity of sin t each side, and the same quantity of cos t on each side. 
Matching up the parts with sin t, we get 

4 sin t = R cos a sin t so 4 = R cos a. 

Matching up the parts with cos t, we get 

3 cos t = R sin a cos t so 3 = R sin a. 

The easiest way to find R and a is to draw a picture showing the information we now have. 
I do this here in Figure 5.D.2. 



Using Pythagoras' theorem gives us R 2 = 3 2 + 4 2 = 25 so R = 5. 

3 

We also see that tan a = 4 so a = 0.6435 radians to 4 d.p. 

We can now write x = 4 sin t + 3 cos f in the alternative form of x = 5 sin (f + a) with 
a = 0.6435 to 4 d.p. (I shall continue calling this angle a for short.) 

What will the graph of x = 4 sin t + 3 cos f = 5 sin (t + a) look like? 

(You will find the answer to this question much easier to understand if you did Exercises 
5.C.1 and 5.C.2 in Sections 5.C.(a) and 5.C.(b). If you haven't yet done these, you should 
go back and do them now.) 

To help us to sketch the curve of x = 4 sin t + 3 cos t = 5 sin (t + a), we relate this to 
how the point P moves round its circle. The displacement x will be shown on the vertical axis 
since it is a sin function. I show this below in Figure 5.D.3(a). 

P is moving round its circle of radius 5 units with an angular velocity of one radian per 
second. It starts at the angle a when t = 0. 




d 

R COS et s 4 

Figure 5.D.2 



RSino< = 3 




-f DC 



Figure 5.D.3 
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When it has moved through a further angle of t, the displacement x is given by x = 
5 sin(r + a). 

We can see from the picture that x will increase first to its maximum value of +5 and then 
decrease through zero to -5. 

We can also see that x would have been equal to zero at a or 0.6435 seconds before the 
instant when we are taking t = 0. 

Using this information we can then draw the sin curve x = 5 sin (t + a) shown in Figure 
5.D.3(b). I have also drawn x = 5 sin t, using a dashed line. You can see that we have a gap 
of a between these two graphs. The angle a is called the phase angle or phase. We see that 
x = 5 sin (t + a) leads x = 5 sin t by a seconds. 

For both graphs, the amplitude A = 5, the angular velocity co = 1, and the period T = 

7jt. 

We have just seen that it is possible to write the function x = 4 sin t + 3 cos t 
in the form x = 5 sin(Y + a) with a = 0.6435 radians. 

Would it be possible to combine 4 sin t + 3 cos t to give a single cos function instead, 
and if so which rule should we use? 



It is possible to do this, and we would need to use the rule for cos {A - B) because this 
gives us the plus sign in the middle. Doing this will give us 

3 cos t + 4 sin t = R cos t cos j3 + R sin t sin /? 

which is the same as R cos (t - j3). 

We can see that R will still be equal to 5 here, but I have called the angle (i to avoid 
confusing it with the angle a which we found earlier. 



3- R COSjS 
Figure 5.D.4 

Matching up the separate terms in sin and cos gives us 3 = R cos /3 and 4 = R sin (5. This 
information is shown on the little triangle in Figure 5.D.4. We see that tan (5 = f so (5 = 0.9273 
radians to 4 d.p. We can also see now that a + {3 = jt/2 because a is the top angle in this 
triangle. So we now have the result that x = 4 sin t + 3 cos t can also be written as 
5 cos (t - ft) with /3 = 0.9273 radians to 4 d.p. 

Drawing the circle diagram for x = 5 cos (t - ji) in Figure 5.D.5(a) shows us that we 
have exactly the same displacement x after time t as before. The only difference is that it is 
now being shown on the horizontal axis as a cos function. This shift in position through a 
right angle is the reason why a + j3 = jt/2. 

At time t we have x = 5 cos (t - ji). 

When t = 0, x = 5 cos (-/?) = 5 cos j3 because the cos graph is even (see Section 5.A.(a) 
if necessary). 
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X* 5 cos ( t - p) 



Figure 5.D.5 

When t = /?, x has its maximum value of 5 cos (0) = 5 units. 

The graph for x = 5 cos (t - /S) is, of course, identical to the graph for x = 5 sin (t + a) 
because both represent x = 4 sin t + 3 cos 

I have shown it again in Figure 5.D.5(b) with the graph of x = 5 cos t shown as a dashed 
line. We see that the phase angle is j3 and x = 5 cos (t - p) lags x = 5 cos t by /? seconds. 
The a + together make the ;r/2 shift between x = 5 cos t and x = 5 sin 

Again, A = 5, <w = 1 and T = 2ji for both graphs. 

You can see from Figure 5.D.5(a) that, as P moves round from its starting position, what 
happens first is that x increases in size to its maximum value of 5 units, and this is what the 
graph of x = 5 cos (t - j3) is also doing. 

5.D.(g) More examples of the R sin (f ± a) and R cos (f ± a) forms 

Here is another example, this time involving a minus sign. 

Write x = 3 cos t - 2 sin t as a single trig function and sketch its curve. 

We start by choosing a rule which will fit nicely to what we have this time, including the 
minus sign in the middle. Which rule should we choose? 



cos (A + B) = cos A cos B - sin A sin B will give the kind of fit that we want. 
We write 

3 cos t - 2 sin t = R cos (t + a) = R cos t cos a - R sin t sin a 

so, matching up the separate parts as before, 3 = R cos a and 2 = R sin a. 

Using the little triangle in Figure 5.D.6 shows us that R = ^13 and tan a = § giving 
a = 0.5880 radians to 4 d.p. 




3 = R COSeC 
Figure 5.D.6 
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We can therefore rewrite x = 3 cos t - 2 sin t in the form x = yf\3 cos(? + a) with 
a = 0.5880 radians. 

This can then be related to the way in which P moves round its circle which I show in 
Figure 5.D.7(a). 




x*Jt3 cos(fc+«0 



Figure 5.D.7 

After time t, the displacement x is given by x = /l3 cos(? + a). 
When t = 0, x = ^13 cos a. 

When t = - a (that is, a seconds before the instant at which we are taking t = 0), x will 
have its maximum size of /T3 cos(0) = -Jl~3. 
When t = 7t/2-a, x= Jl3 cos(^r/2) = 0. 

We can now sketch the graph of x = y 13 cos (t + a). I show this in Figure 5.D.7(b), with 
the graph of x = y[l3 cos t shown as a dashed line. The phase angle is a and x = 
^13 cos (t + a) leads x = ^f\3 cos t by a seconds. 

For both the graphs, we have A = ^13, co = 1 and T = 2ji. 

Each of the circle diagrams which we have drawn shows very nicely how its related graph 
works. (It's very easy to see on the circle diagram just what effect the shift given by the angle 
a is having.) But you may be thinking that it is just being perverse to measure time in such 
a way that we get these shifts to worry about. Surely in the real world we can choose to have 
t = 0 when a = 0? 

Not necessarily so! There are some physical situations where we have to deal with waves 
which are out of phase with each other. For example, if we are working with the functions 
which describe how the voltage and current in an alternating current (a.c.) circuit change 
with time, and if this circuit includes components with inductance or capacitance, the 
current will peak after the voltage does, and so the two wave functions describing them will 
be out of phase with each other. 

I'll now give you an example which involves functions of 2t instead of t. We'll combine 
x = 3 sin It + cos 2t into a single trig function and sketch its graph. 

How can we write 3 sin 2t + cos 2i using one of the rules for combined angles? 



We could say either 

3 sin 2t + cos 2t = R sin (2t + a) = R sin 2t cos a + R cos 2t sin a 
or cos 2t + 3 sin 2t = R cos (2t - ft) = R cos 2t cos j3 + R sin 2t sin (5. 
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I shall work with the first of these, but the second would of course give an identical curve. 
We have 

x = 3 sin 2t + cos 2t = R sin 2t cos a + R cos 2t sin a. 

(Notice that everything here is in terms of 2t instead of t.) 
Now, matching up the separate parts, we have 3 = R cos a and 1 = R sin a. 
Drawing the little triangle in Figure 5.D.8 shows us that R = /To and tan a = \ so 
a = 0.3218 rads to 4 d.p. 

\s R Sir* 

3 = R COS * Figure 5.D.8 

This gives us 

x = 3 sin 2t + cos 2t = /lO sin (2t + a) 

with a = 0.3218 radians to 4 d.p. 

We now know that when t = 0, x = VTo sin a and, when 2t + a = ji/2, x = /To sin (jt/2) 
= /To. This happens when t = 5 (pt/2 - a) = 0.624 seconds to 3 d.p. 

As usual, we shall need the circle picture to help us to draw the graph. I show this in 
Figure 5.D.9(a) below. We shall also use these two diagrams in Section 9.C.(c) when we look 
at some differential equations which describe SHM. 

This time, P is moving at 2rad/s. 




Figure 5.D.9 



^^^^ From the circle picture, we can see that we shall have to be very careful 
^B-^V about labelling the interesting points on the graph sketch this time. 

P is moving at 2rad/s so the period of the function is Jt seconds. (Each cycle takes it 
seconds.) Because it is moving at 2rad/s it would have been at the point A at a/2 seconds 
before the instant when we took t = 0. 
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We also know that x has its first maximum value of /To after j i 31 ^ ~ a ) seconds. 

Using this information, I have drawn the function x = VTo sin (It + a) in Figure 5.D.9(b). 
I've also sketched x = y 10 sin 2t using a dashed line. 

The phase angle is a and we see that x = /To sin {It + a) leads x = yTo sin It by all 
seconds. 

For each graph, A = y 10, co = 2 and T = 2nl2 = it. 

exercise 5.D.1 Now try the following questions yourself. Give all your angles in radians, either 

exactly or to 3 d.p. 

For each question, you should also draw a diagram showing the related motion 
of P round its circle. Then use this to sketch the graph of the single combined trig 
function which you have found, in the same way that I have done in my examples. 
Make sure that you label your diagrams clearly, and then use them to write down 
the values of A (the amplitude), a (the angular velocity) and T (the period), of 
each of your combined trig functions. 

(1) Find x = v 3 cos r - sin f in the form x = R cos(r + a). 

(2) Find x = 5 cos f + 12 sin f in the form x = R cos(f - a). 

(3) By choosing a suitable formula, find x = 15 cos f - 8 sin f as a single 
combined trig function. 

(4) By choosing a suitable formula, find x = 2 cos f - 3 sin r as a single 
combined trig function. 

(5) Find x = cos 4f - sin 4f in the form R cos(4f + a). 

(6) Write v 3 sin 3f - cos 3f in the form R sin(3f - a). 

5.D.(h) Going back the other way - the Factor Formulas 

We can use the formulas for sin(^ + B) and sin(^4 - B) to find a useful new way of writing 
the sum of the sines of two angles. 

If we call the two angles P and Q, then we shall find another way of writing sin P + sin Q. 

This is how we do it. We know 

sin (A + B) = sin A cos B + cos A sin B, 

sin (A - B) = sin A cos B - cos A sin B. 

Adding these two equations gives 

sin (A + B) + sin (A - B) = 2 sin A cos B. 

What we actually want is a formula for sin P + sin Q. How can we choose P and Q so that 
they match up with what we have just got? 



We need to put P = A + B and Q=A - B. Then we have 

P+Q P-Q 

P + Q = 2A so A = and P - Q = 2B so B = 

2 2 

This gives us the result 



f P + Q\ (P-Q 
sin P + sin Q = 2 sin I cos 
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Similarly, it can be shown that 



sin P - sin Q = 
cos P + cos Q 
cos P - cos Q 



^^^^ Notice the minus sign at the start of the rule for cos P - cos 0. 

^K^F You can see that it must be there if you put LP = 60° and LQ = 30°, for 
example, cos 60° is smaller than cos 30°, but sin 45° and sin 15° are both 
positive. 



It is sometimes useful to be able to make use of the midway steps for each of these. 
We found in the working above that sin (A + B) + sin (A - B) = 2 sin A cos B. 
The three rules like this one, put together in a box, are: 



2 sin A cos B = 


sin(J + B) + 


sin (.4 - 


B), 


2 sin A sin B = 


cos (A -B)- 


cos (.4 4 


B), 


2 cos A cos B = 


-- cos(A + B) 4 


- cos (A 


-B). 



These two sets of rules are useful to turn adding into multiplying to make it easier to 
solve certain types of trig equation. I show you an example of this in Section 5.E.(d). They 
are also useful the other way round, when they turn multiplying into adding, for certain kinds 
of integral. Example (8) in Section 9.B.(f) shows you how this works. 

We have now obtained all the basic trig rules involving two angles, and so have them 
ready for use whenever we need them. 

You might find it helpful now to go through the previous sections highlighting in colour 
all the boxes with these rules inside, so that you can quickly find them when you need them, 
and can become familiar with them. 

5.E Solving trig equations 

5.E.(a) Laying some useful foundations 

Quite often, students don't like solving trig equations because they find the possibilities of 
more than one answer confusing. It's in the nature of trig equations that they will have an 
infinite number of solutions - we only need to look at the repeating graphs of v = sin x and 
y = cos x to see this. (Of course, physical circumstances may limit the number of possible 
answers; for example, any angle in a triangle must be somewhere between 0° and 180°.) 



(P+Q\ (P-Q\ 
: [—j c ° s I" J' 

(P + Q\ , (P-Q\ 
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When infinite numbers of answers are possible, we shall use the patterns of how they 
come to describe them. To do this, we shall need the circle definitions for the trig ratios of 
angles greater than 90° of Section 5.A.(c). I think you will find that it will help you here if 
you read through this section again before going on. Then do the following exercise which 
is based on the results of this section, and which will also give you some particular values 
which will be useful for solving equations. 

exercise 5.E.1 The table below is very similar to the one I gave you for Exercise 5.A.1 in Section 

5.A.(a) except that I have only included positive angles here, and I have put in a 
line for the tan of the angles, too. In that exercise, you worked out the values for 
the sin and cos of the extra angles by using the graphs of y = sin x and y = cos x. 

Try filling in the blanks again by thinking how each angle will come in the 
turning circle, and then matching it up with an angle for which I've given you the 
sin, cos and tan. The values for your angle will then be the same as these except 
for a change of sign in some cases. 

Write your answers in the same form that mine are given in, including v signs if 
necessary, because you will find when you use these results that exact answers 
are often easier to work with than strings of decimals. Then check that your 
answers are right by using your calculator. (It's best to use pencil until you have 
checked!) 



Angles 




71 


it 


71 




2.71 


3jt 


5jt 


ji 


7tc 


5jt 


4tc 


3tc 


5JT 


7jt 




2JI 


(radians) 


0 


6 




3 


2 


T 


T 




~6 


T 


T 


2 


T 


T 


~6~ 


Degrees 


0 


30 


45 


60 


90 


120 


135 


150 


180 


210 


225 


240 


270 


300 


315 


330 


360 


sin 


0 


1 
2 


1 

V'2 


v'3 
2 


1 


























cos 


1 


v'3 
2 


1 


1 

2 


0 


























tan 


0 


1 

71 


1 


v3 


u 



























U stands for 'undefined'. 



We can now start solving trig equations by using the patterns of how these solutions come 
to give us a way of describing the infinite number of possible answers. This is called giving 
the general solution. 

The easiest way for me to explain how to do this is for us to work through some particular 
examples together. I shall take separate examples for sin, cos and tan with one positive and 
one negative value in each case, so that we cover all the possibilities. Then we shall use these 
to build up the rules for the general solutions for each particular case. 

When we solve trig equations, we are working back from the sin, cos or tan of the angle 
to the angle itself. This means that we shall have to use the inverse functions of sin -1 , cos -1 
and tan -1 (or arcsin, arccos and arctan as they are sometimes known). If you are unsure about 
these, you should go back now to Sections 5.A(f), (g), (h) and (i) to see how they work. 
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The angle given by your calculator from a known sin, cos or tan is the angle given by 
using the inverse function. (Remember that a function gives just one possible result for every 
value fed into it.) We know that for any particular value of sin, cos or tan, there are an 
infinite number of possible matching angles. 



The angle given by using a trig inverse function is called the principal value. 



For example, if sin x = 5, then the principal value for the angle x in radians is 7t/6. This 
is what sin" 1 (5) gives you. But other possible solutions to the equation sin x = 5 are the 
angles 5tt/6, 13jt/6, 17jt/6, etc. and there are an infinite number of these. 

5.E.(b) Finding solutions for equations in cos x 

I am starting with cos x because this is the easiest one to write down the patterns for. We'll 
solve the equation 6 cos 2 x - cos x - 1 = 0 

(a) for the principal values, 

(b) for all angles between 0° and 360°, 

(c) for all possible angles, giving the answers in degrees. 

This is just a quadratic equation like the ones we worked with in Chapter 2. If you like, you can 
put cos x =y in the equation, which then gives you 6y 2 - y - 1 = 0. This factorises to give 

(2y - l)(3j + 1) = 0 or (2 cos x - 1)(3 cos x + 1) = 0 

replacing y by cos x. You can also factorise straight to this form without bothering with the 
y if you like. 

From this, there are two possible solutions for cos x. 

Either 2 cos x - 1 = 0 so cos x = \ and the principal value of x is 60°, or 
3 cos x + 1 = 0 so cos x = -| and the principal value of x is 109.5° to 1 d.p. (This 
answer is 109.47 to 2 d.p. and I'll use this in any further working to avoid rounding errors.) 
These two angles give us the answer to (a). 

Now we answer (b) by finding all the solutions of the equation between 0° and 360°. 
It's easiest to see where these must be if we use the two circle diagrams of Figure 5.E.I. 
From Figure 5.E.l(a) we get a second possible solution of 360° - 60° = 300°. 
From Figure 5.E.l(b) we get a second possible solution of 360°- 109.47 = 250.5° to 1 d.p. 
Use your calculator to check that x = 300° and x = 250.5 do fit the equation which we 
started with. 




Figure 5.E.1 
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(c) Now we want to find all the possible solutions to the given equation. 

Looking at the two circle diagrams of Figure 5.E.1, we can see that each pair of answers 
is symmetrically placed either side of the horizontal axis. 

Adding any number of full turns to each of the four solutions we already have will give 
further possible solutions. 

We can show all these further solutions by writing the ones which we already have in the 
form 

x = 360°« ± 60° and x = 360°« ± 109.5° 

where n is any whole number. (Remember that '±' means 'plus or minus'.) 

The answers which we already have for (ii) could have been found by putting n = 0 and 
n = 1 in the two general solutions above and then picking out the ones which come between 
0° and 360°. (Try doing this for yourself.) 

You can also see that these answers agree entirely with what happens if you use the graph 
of cos x, by looking at Figure 5.E.2. The answers are given here by the x values at the 
intersections of y = cos x with the two lines y = j and y = -3. 

We have now seen that the two sets of general solutions are given by 

x = 360« ± (the principal value in degrees) 
and that this was true whether the principal value was positive or negative. 




Figure 5.E.2 



These are the rules which we now have. 



Finding all possible solutions for the angles from a given cos 

You must decide whether you are working in degrees or radians before you start. 

• If cos x = a, first find cos -1 a on your calculator. 
cos -1 a is called the principal value for the angle. 

• If you are working in degrees, all the possible values are then given by 

x = 360°« ± (the principal value in degrees). 

• If you are working in radians, all the possible values are then given by 

x = 2jin ± (the principal value in radians), 
where n is any whole number. 

This is called the general solution of the equation cos x = a. 
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^^^^ Never give a mixed answer like x = 2rm ± 60° because this is meaningless. 
^H-^V You must work completely either in degrees or in radians. (If you need help 
with radians, see Section 4.D.) 



exercise 5.E.2 Try solving the similar equation 2 cos 2 x + 3 cos x + 1 = 0 for yourself, 

(a) for the principal values, (giving your answers in degrees), 

(b) for all angles between o° and 360 0 , 

(c) for all possible angles, that is, the general solution. 

5.E.(c) Finding solutions for equations in tan x 

We'll use the following example to show how this is done. 
Solve the equation sec 2 x - tan x - 3 = 0 

(a) for the principal values, 

(b) for all angles between 0° and 360°, 

(c) for all possible angles. 

We have a difficulty here which is that this equation is partly in terms of sec x and partly 
in terms of tan x, and we can't do anything with it as it stands. But we found earlier a 
relationship between sec x and tan x which we can use here. 

Can you remember what it is? 



We can use the identity tan 2 x + 1 = sec 2 x (Section 5.B.(b)). 
Substituting for sec 2 x using this, we now have 

(tan 2 x + 1) - tan x - 3 = 0 so tan 2 x - tan x - 2 = 0 

so (tan x - 2)(tan x + 1) = 0. 

(a) Either tan x - 2 = 0 so tan x = 2 and the principal value of x is 63.43 = 63.4° to 
1 d.p., or tan x + 1 = 0 so tan x = -1 and the principal value of x is -45°. 

(b) Now we want all the solutions between 0° and 360°. 

Using the definition for the tan of an angle greater than 90° from Section 
5.A.(c), we can see where the other two solutions between 0° and 360° must be. 

Figure 5.E.3(a) shows the two solutions of tanx = 2, and Figure 5. E. 3(b) shows 
the two solutions of tan x = -1 between 0° and 360°. 




Figure 5.E.3 
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(c) Adding any number of full turns to the solutions above will give all the possible 
solutions. 

Can you see what pattern these will have? Look particularly at what happens 
after any number of half turns. 



This time, the principal value is always added on to however many half turns have been 
made. 

This adding on takes into account the sign of the principal value, so 135° = 180° + 
(-45°), for example. 

The general solution is given by x = 1 80°« + 63 .4 and x = 1 S0°n - 45°, where n is a whole 
number (or integer). 

You can see how these solutions will also work graphically by looking at Figure 5.E.4 
below. 




Solutions between O* and 360° ^ 

Figure 5.E.4 

The solutions are given by the x values at the intersections of y = tan x with the two lines 
y = 2 and y = -1. 

These are the rules which we now have. 



Finding all possible solutions for the angles from a given tan 

• If tan x = a, first find tan -1 a on your calculator. 
tan -1 a is the principal value for the angle. 

• If you are working in degrees, all the possible values are then given by 

x = 180°« + (the principal value in degrees). 

• If you are working in radians, all the possible values are then given by 

x = nil + (the principal value in radians) 

where n is any whole number. 

(You must include the sign of the principal value in these rules.) 
This is called the general solution of the equation tan x = a. 
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exercise 5.E.3 Try solving the similar equation of sec 2 x + 2 tan x - 4 = o for yourself 

(a) for the principal values, giving your answers in degrees, 

(b) for all angles between o° and 360 0 , 

(c) for all possible angles, that is, the general solution. 

5.E.(d) Finding solutions for equations in sin x 

We'll use the example of solving the equation 1 + 3 sin x - 5 cos 2x = 0 

(a) for the principal values, 

(b) for all angles between 0° and 360°, 

(c) for all possible angles, giving the answers in degrees. 

Again we have a mixed equation. We need to use a trig identity so that we can write it 
just in terms of sin x. 

How else can we write cos 2x1 



We can say that cos 2x = 1 - 2 sin 2 x from Section 5.D.(d). 
Substituting this in the equation gives us 

1 + 3 sin x - 5 (1 - 2 sin 2 x) = 0. 

From this we get 

10 sin 2 x + 3 sin x - 4 = 0 so (2 sin x - 1) (5 sin x + 4) = 0. 

1 4 

(a) Either sin x = j, which gives the principal value of x = 30°, or sin x = - 5, which 

gives the principal value of x = -53.13° = -53.1° to 1 d.p. 

(b) All the possible solutions between 0° and 360° can be seen from the two circle 
diagrams in Figure 5.E.5. 




Figure 5.E.5 

Circle (a) gives us 30° and 180° - 30° = 150°. 

Circle (b) gives us 360° - 53.13° = 306.9° to 1 d.p. and 180° + 53.13° = 233.1° 
to 1 d.p. 

(c) The pattern for getting all the possible solutions is a little bit harder to spot this 
time as the principal value is sometimes being added on and sometimes being 
taken off. Can you see how to describe this pattern? It might help you if you think 
about the number of half turns involved as you get to each new solution. 



5.E Solving trig equations 



221 



We know that all the possible solutions will be given by adding any number of full turns 
to the four solutions which we already have. 

If we look at Figure 5.E.5(a) first, this gives 360°« + 30° and 360°« + 180° - 30°. 

Now 360°« = 2 X 180°«, so we can write these two answers as 2 X 180°n + 30° and 
2 X \S0°n + 180° - 30°. 

This is the same as In (180°) + 30° and (In + 1) 180° - 30°. 

If the number of half turns is even, we add on the 30°. 

If the number of half turns is odd, we take off the 30°. 

These two results can be ingeniously combined by using (-1)", because (-1)" gives us +1 
if n is even and -1 if n is odd. 

All the possible solutions from sin x = \ are given by x = lS0°n + (-1)" 30°. (The two 
solutions of (b) are given by putting n = 0 and n = 1 .) 

4 

In just the same way, all the possible solutions of sin x = - 5 are given by writing 
x = 180°n + (-1)" (-53.1°). 

You can also see how these solutions are building up in the sketch graph of Figure 5.E.6. 
They are given by the x values at every intersection of the curve of y = sin x with the two 
lines y = 2 and y = -5 respectively. 




4 Solutions between O" cmd 360*' 



Figure 5.E.6 

The box below gives the rules which we have now found. 



Finding all possible solutions for the angles from a given sin 

• If sin x = a, first find sin -1 a on your calculator. 
sin -1 a is called the principal value for the angle. 

• If you are working in degrees, all the possible values are then given by 

x = 180°« + (-1)" (the principal value in degrees). 

• If you are working in radians, all the possible values are then given by 

x = Jin + (-1)" (the principal value in radians), 
where n is any whole number. 

(You must include the sign of the principal value in this rule.) 
This is called the general solution of the equation sin x = a. 



222 



Extending trigonometry 



exercise 5.E.4 Try solving the equation cos 2 x + 2 sin x = 1 for yourself 

(a) for the principal values (giving your answers in radians), 

(b) for all angles from 0 to 2jt, 

(c) for all possible angles, that is, the general solution. 

I will finish this section with an example of a slightly different kind of equation involving 
sin x. 

Suppose we need to solve sin 3x = sin x for angles between 0 and 2jt. 

See how far you can get with this yourself before looking at what I have done. 



It's easy to spot that x = 0 is one solution of this equation, but how can we set about 
finding the others? 

Figure 5.E.7 shows a snapshot of what's happening graphically. 




Figure 5.E.7 



We can now see that x = jt and x = 2n will also fit, but what values of x will give the other 
four solutions? 

We have sin 3x = sin x so sin 3x - sin x = 0. 

Now we use the second of the four factor formulas from Section 5.D.(h) 

(P+Q\ (P-Q\ 

sin P - sin Q = 2 cos sin 

I 2 J \ 2 J 

and put 3x = P and x = Q. This gives us 

2 cos(2x) sin x = 0 so sin x = 0 or cos 2x = 0. 
From sin x = 0 we get x = 0 or 71 or 2ji. 

From cos 2x = 0 we get 2x = 2rm ± 7tl2 so x = rm ± jz/4, giving us the other four 
solutions of x = jt/4, 3jt/4, 5tt/4 and 7jt/4. 

There is often more than one possible method for solving these equations. For example, 
we could have done this one by writing sin 3x = 3 sin x - 4 sin 3 x from Section 5.D.(e) and 
then factorising. Also, in the method above, when we had cos 2x = 0 we could have used 
cos 2x = 1 - 2 sin x, giving sin x = j so sin x = ± — . Sometimes one method is neater 
than another, but there is no magic 'right way'. 



5.E Solving trig equations 



223 



exercise 5.E.5 Try solving the following equations which use the whole of Section 5.E so far. 

In each case, find (a) the principal value(s), (b) solutions for o° < x < 360 0 or 
o < x < 2jt (I give the units after each question), and (c) the general solution. (Give 
your answers correct to 1 d.p. for degrees and 2 d.p. for radians.) 



I think it is much easier to use the general solutions to find the answers 

between 0° and 360° or 0 and 2jt. You just need to put in the values for n 

which give the answers in the desired range. I suggest you try doing this. 

2 1 
(1) cos x = 3 (deg) (2) tan x = 5 (deg) (3) cos x = -5 (rad) 

(4) tan x = -1 (rad) (5) sin x = 0.4 (deg) (6) 6 sin 2 x + 5 cos x = 7 (rad) 

(7) tan 2 x = tan x (rad) (8) 3 sec 2 x + tan 2 x = 5 (deg) 

(9) sin 2x = 3 cos x (rad) (10) sin 5x + sin x = 0 (deg) 




5.E.(e) Solving equations using R sin (x + a) etc 

What should you do if you meet a problem like the following one? 

Solve, when possible, for angles between 0° and 360°, the three equations 

(1) 4 sin x + 3 cos x = 6, 

(2) 4 sin x + 3 cos x = 5, 

(3) 4 sin x + 3 cos x = 2. 

It is not difficult to do this if we use the results of Section 5.D.(f). 

We showed there that we can write 4 sin t + 3 cos t in the form 5 sin (/ + a) with 
a = tan 1 4. (The only differences here are that we have x instead of t, and that we are 
working in degrees instead of radians, so a = 36.87° to 2 d.p.) 

If you are at all unsure about this, you should go back now to Sections 5.D.(f) and (g), 
and work through them before going any further. Then see if you can solve the three 
equations yourself. 



This is what I hope you have found. 

(1) There is no possible solution here. We can see this in two ways. 

Firstly, if 5 sin (x + a) = 6 then sin (x + a) = f which is impossible. 

You can also see this by looking at the graph of y = 5 sin (x + a) which I have 
sketched in Figure 5.E.8. 

You can see here that the line y = 6 misses this sine curve completely, so there 
are no solutions to the equation. 

(2) Again, we can look at this in two ways. 

We have 5 sin (x + a) = 5 which gives sin (x + a) = 1 , so the principal value 
of (x + a) is 90°. 

From this, we can say that (x + a) = 180°« + (-1)" 90° using the rule for the 
general solution from Section 5.E.(d). 

This then gives us x = 180°« + (-1)" 90° - a. 

Putting a = 36.87 gives us the single solution between 0° and 360° of x = 53.1° 
to 1 d.p. 
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Figure 5.E.8 

This answer fits with what we can see is happening graphically. The line y = 5 
is a tangent to the curve y = 5 sin (x + a), and only touches it once between x = 
0° and x = 360°. 

2 

(3) Now we have 5 sin (x + a) = 2 so sin (x + a) = 5 which gives the principal value 
of (x + a) as 23.58° to 2 d.p. 

Therefore, the general solution for (x + a) is given by 180°« + (-1)" (23.58°) 
or x + 36.87° = 180°« + (-1)" (23.58°), putting a = 36.87°. 

Putting n = 0 gives x = - 13.3°, n = 1 gives x= 119.6° and n = 2 gives x = 346.7° 
all to 1 d.p. 

You can see all three of these answers on the sketch graph in Figure 5.E.8. The last two 
of them give the solutions in the range from 0° to 360° that we want. 

Notice that the answers given by the general solution for (3) are symmetrically placed 
either side of the answers for (2), and that all these answers have been affected by the sliding 
along to the left by a of the graph of y = 5 sin x to give y = 5 sin (x + a). 

^^^^ The most usual mistake made when solving this sort of equation goes as 

The solver gets to x + a = 23.58° correctly and then rearranges this to 
get the correct answer for x of -13.3°. 

Then they think 'Curses, I needed a general solution here! Oh well, I'll 
putx = 180°« + (-1)" (-13.3°).' 

This is not true! The general solution comes from using the graph of 
y = 5 sin (x + a) and the solutions must be found taking the whole of 
(x + a) as I have done. 



exercise 5.E.6 Try these two for yourself now. 

(1) Solve, when possible, the three equations 

(a) 3 cos f - 2 sin r = 4, 

(b) 3 cos f - 2 sin f = /13, 

(c) 3 cos f - 2 sin f = 1 for 0 < f < 2jt giving your answers to 2 d.p. 
Show your answers on a sketch graph. 

(2) Solve the equation 3 sin 2f + cos 2t = 2 for angles between o° and 360 0 . 
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Sequences and series 



In this chapter we look at different patterns in sequences of numbers, and how they 
might be described. We discover how it is possible to find the sum of the terms of 
some of these sequences, and find some practical applications of these sums. We 
begin to see how infinite quantities of things behave through looking at what 
happens if we have very large numbers of them. Endless quantities of things have 
to be treated with great caution, so I show you some examples of what can happen 
otherwise. 

The chapter is divided into the following sections. 

6.A Patterns and formulas 

(a) Finding patterns in sequences of numbers, 

(b) How to describe number patterns mathematically 

6.B Arithmetic progressions (APs) 

(a) What are arithmetic progressions? (b) Finding a rule for summing APs, 

(c) The arithmetic mean or 'average', (d) Solving a typical problem, 
(e) A summary of the results for APs 

6.C Geometric progressions (GPs) 

(a) What are geometric progressions? (b) Summing geometric progressions, 

(c) The sum to infinity of a GP, (d) What do 'convergent' and 'divergent' mean? 

(e) More examples using GPs; chain letters, (f) A summary of the results for GPs, 

(g) Recurring decimals, and writing them as fractions, 

(h) Compound interest: a faster way of getting rich, (i) The geometric mean, 
0) Comparing arithmetic and geometric means, 

(k) Thinking point: what is the fate of the frog down the well? 

6.D A compact way of writing sums: the 2 notation 

(a) What does 2 stand for? (b) Unpacking the 2s, 

(c) Summing by breaking down to simpler series 

6.E Partial fractions 

(a) Introducing partial fractions for summing series, 

(b) General rules for using partial fractions, (c) The cover-up rule, 

(d) Coping with possible complications 

6.F The fate of the frog down the well 



6.A Patterns and formulas 

6.A.(a) Finding patterns in sequences of numbers 

We shall start by looking at some lists of numbers for which there is an underlying pattern 
so that there is some rule for writing down the next number. A list of numbers like this is 
called a sequence. A particular number from a sequence is called a term of the 
sequence. 

Here are some examples. In each case, see if you can fill in the next three terms in the 
sequence, and write down the rule that you are using so that somebody else could continue 
filling in where you have stopped. 
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(a) 
(c) 
(e) 
(g) 
(i) 

GO 



1,2,3,4, 5,.. 
2, 5, 8, 11, 14, 
1,2, 4, 7, 11,. 



l l j_ j_ 

3, 6, 12, 24' • • • 



1, 4, 9, 16, 25, 
1, 8, 27, 64,.. 



(b) 
(d) 
(f) 
(h) 

0) 

(1) 



12 3 4 
2, 3> 4> 5' • • • 



1, 3, 5, 7, 9,.. . 
1,2, 4,8,... 
54, 18, 6,2,|,... 



1,2, 3, 5, 8, 13,21,... 
1, 2, 6, 24, 120,. . . 



Here are the answers for you to check yours against. 

(a) 6, 7, 8. The counting numbers, or add 1 each time. 

(b) 11, 13, 15. The odd numbers. Add 2 each time, starting from 1. 

(c) 17, 20, 23. Add 3 each time, starting from 2. 

(d) 16, 32, 64. Double each time, starting from 1. 

(e) 16, 22, 29. Start by adding 1 to the first term, which is itself 1. Then, for each new 
term, add 2, 3, 4, etc. so that the number you add is always 1 more than the 
previous number added. 

2 2 2 

(f) 9, 27, gj. Take one third of the previous term each time, starting with 54. 

(g) ?8' % » 192- Take one half of the previous term, starting from one third. 

(h) | , 7, g. For each new term, add 1 to both the top and the bottom of the fraction 
which makes the previous term. 

(i) 36, 49, 64. This sequence is formed from the squares of the counting numbers, 
(j) 34, 55, 89. After the first two terms, each term is made by adding the previous two 

terms. This is called a Fibonacci sequence, 
(k) 125, 216, 343. These terms are the cubes of the counting numbers. 
(1) 720, 5040, 40 320. The terms of this sequence are formed by finding 1, 2 X 1, 

3X2X1, etc. They are called factorials, and are written as 1!, 2!, 3!, etc. 



It is often useful to be able to write down a rule or formula which will tell us how to find 
any term we want in a sequence of numbers such as the ones above. To be able do this, we 
shall need a shorthand system for labelling the terms. We will use the system of calling them 
«!, u 2 , u 3 , . . . so that u 4 for (b) is 7, and u 5 for (e) is 11. If we don't want to specify a 
particular number, we can call the term u„ where n is standing for any number which we 
might later want to choose. We call u n the general term. 



What we now want to do is to find some way of writing a rule which gives the general 
term or u n for each of the sequences from (a) to (1). 

The easiest way of explaining how we can set about doing this is to take two particular 
examples. 



6.A.(b) 



How to describe number patterns mathematically 




The n in u n is called a subscript and is just a label telling us how far we 
have gone. Don't confuse it with u" which means u multiplied by itself 
n times. 
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example (1) Sequence (c) goes 2, 5, 8, 11, 14,. 



The description in words for this was 'add 3 each time, starting from 2.' 

There are two ways in which we can write this mathematically. 

We can say u x = 2, u 2 = 2 + 3, u 3 = 2 + (2 X 3), w 4 = 2 + (3 X 3) 
and so on, so that we are describing each term using the actual numbers 
which make it up. We'll call this description (A). 

Sticking to the same system, how would you write m 7 ? How would 
you write w„? 



u 7 = 2 + (6 X 3) and u„ = 2 + ((« - 1) X 3) = 2 + 3n - 3 = 3n - 1. 

Notice that we needed (n - 1) rather than n when we first wrote down the rule for u n . We 
can check this rule by testing it when n = 5. We get 3X5-1 = 14 which we know is 
correct. 

We could also think of this sequence as building up in a chain, each new term coming 
from the previous term according to a particular rule. We'll call this description (B). 

Description (B) for this sequence would be u n = u n _ 1 + 3. But just knowing this would 
not be enough, because, for example, the sequence 1, 4, 7, 10, 13, . . . would also fit this 
description. However, if we also give the value of the first term, the sequence is fully 
described. 

Description (B) is u n = u n _ Y + 3 and u 1 = 2. 

example (2) Sequence (g) goes 5, \, n, ji- . . . 

The description in words for this was 'take one half of the previous 
term starting from one third'. 

Description (A) We can say that u x = f, u 2 = \ X \, w 3 = \ X \ X | = (|) 2 X \ 

so w 7 , say, is (j) 6 X 3 and u n = (2)" 1 x 3- 

Notice that we need a power of n - 1 here to make u n work 
correctly, not n. 

Description (B) We can say that u n = \u n _\ and u x = \. 

Just as in the last example, if we don't say what u x is, we could 
get quite a different sequence. For example, the sequence 24, 12, 
6, 3, . . . also fits the description 

Sometimes both these methods of description are useful when we are considering 
particular sequences. Sometimes one is very much easier to find than the other. 

exercise 6.A.1 Try finding the following descriptions for yourself now. Keep a special eye out for 

sequences which can be described in a similar way to each other because we shall 
be looking at some of these in more detail in the next two sections. 

(1) Find descriptions (A) and (B) for sequence (a) on page 225. 

(2) Find descriptions (A) and (B) for sequence (b). 

(3) Find descriptions (A) and (B) for sequence (d). 

(4) Find just description (B) for sequence (e). 

(5) Find both descriptions (A) and (B) for sequence (f). 

(6) Find just description (A) for sequence (h). 

228 Sequences and series 



(7) Find just description (A) for sequence (i). 

(8) Find just description (B) for sequence (j)- 

(9) Find just description (A) for sequence (k). 

(10) Find both descriptions (A) and (B) for sequence (I). 



I am giving the answers to this exercise here as we shall be needing some of them in the 
next two sections. 

(1) Description (A) for sequence (a) gives u n = n and description (B) gives 
u„ = u„_ x + 1 with W| = 1. 

(2) For description (A) for sequence (b), we can say that each odd number is one behind 
the corresponding term in the sequence of even numbers, so u„ = 2n - 1 . 



It is useful to remember this as a formula which must give an odd number. 
Similarly, 2n + 1 must also always be an odd number, while In is always 
even. 



Description (B) for this sequence says u n = u n _ l + 2, with u x = 1. 

(3) Description (A) for sequence (d) is u 2 = 2 X 1 and u 3 = 2 2 X 1 etc. 
so u n = 2"- 1 X 1 = 2"-\ 

For description (B) we have u„ = 2u„_ x with u x = 1. 

(4) Description (B) for sequence (e) is u„ = u„_ x + (n - 1) with u x = 1, or you could 
write this asw„ + 1 = H„ + /j with u x = 1. 

It is quite difficult to find a formula for u„ in terms of n here, just by looking 
at the terms, which is why I didn't ask you to do it. 

In fact, the rule for (A) is u n = ji 1 - j n + 1- Check for yourself that this works 
for n = 1,2 and 3 . 

(5) For sequence (f), if we write u 2 = 18 = (|) 54, and u 3 = 6 = (f) 2 54, we see that 
u n = 54, so this is description (A). 

Notice, here, that the first term uses = 1, which is one of the rules from 
Section l.D.(b). 

Description (B) is u n = \{u„_ x ) with u x = 54. 

n 

(6) Description (A) for sequence (h) is u n = . 

n + 1 

(7) Description (A) for sequence (i) is u n = n 2 . 

(8) Description (B) for sequence (j) is u n = u n _ x + u n _ 2 with u x = 1 and u 2 = 2. 

The formula for u n in terms of n is so unlikely that even your wildest guesses 
would never have produced it. 




If you substitute some values for n in this formula, and use a calculator, you will 
find that you do indeed get the right terms. 



6.A Patterns and formulas 



229 



(9) Description (A) for sequence (k) is u n = n 3 . 

(10) Description (A) for sequence (1) is u„ = n\ 

This means that u n _ x = (n - 1)! But n\ = n(n - 1)! so description (B) is 
u„ = nu n _i with U\ = \ . 

A formula which describes u n using the previous terms of the sequence, such as 
u n = u n - 1 + u n - 2 f° r the Fibonacci sequence, is called a recurrence relation or difference 
equation. Such equations have important applications in electrical engineering. 

6.B Arithmetic progressions (APs) 

6.B.(a) What are arithmetic progressions? 

The sequences (a), (b) and (c) in Section 6A.(a) are all examples of arithmetic progressions 
or APs for short. 

If you look back, you will see that in each case each new term is made by adding the 
same constant number to the previous term. 

We can write this type of sequence in the form 

a, a + d, a + 2d, a + 3d, . . . 

where a is the first term (so Wj = a) and d is what is called the common difference between 
each successive pair of terms. 

In (a), a = 1 and d = 1 . What are a and d in (b) and (c)? 



We would have a = 1 and d = 2 in (b), and a = 2 and d = 3 in (c). 



The nth term of an AP is given by u n = a + (n - \)d since we have only added d 
on (n - 1) times. 



^^^^ It's easy to think that the «th term will be a + nd but this is not so! 

" 

If the particular AP which we are considering only has n terms, so that u„ is the last term, 
we sometimes call this last term /, so then u„ = I = a + (n - X)d. 
Suppose we have the AP 1, 3, 5, 7, . . ., 33. 

(The dots in the middle signify that there are a whole lot of other terms here which we 
do not want to (or even in some cases cannot) list individually. This use of dots is a standard 
piece of mathematical language.) 

How many terms have we got here? 

Using u n = I = a + (n - \)d with a = 1 and d = 2 gives 

/ = 33 = 1 + (n - 1)2 = 1 + 2n - 2 so 2n = 34 and n = 17. 

(Equally, each individual jump is of size 2, and the total jump from 1 to 33 is 32. 
Therefore, we have 1 6 jumps and 1 7 terms. This is like fence-posts and the gaps between 
them; there is one more post than there are gaps.) 
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Try these two yourself. 

For each of the APs (1) 3, 7, 11, . . . , 79 and (2) 102, 100, 98, . . . , 14 write down the 
values of a and d. How many terms are there in each series? 



You should have these answers. 

For (1), a = 3 and d = A which gives 79 = u„ = / = 3 + (n - 1)4 = 3 + An - A so 80 = An 
and n = 20. 

For (2), a = 102 and d = -2. (The common difference here is negative.) 
We have u n = I = 14 = 102 + (« - 1) (-2) = 102 - In + 2 so 2n = 104 - 14 
and n = 45. 

6.B.(b) Finding a rule for summing APs 

For practical purposes, we often need the sum of some number of terms of an AP. 

When the terms are added together, we call the result a series. 

The process of actually adding the terms to find their sum is called summing the series. 

Is there any way in which we can do this without actually having to add on each term 
separately? 



There is a very neat way to do this. Think what happens if we turn the series the other 
way round and then add it to itself in the original order. The pairs of terms exactly slot into 
each other to give the same result, like two staircases fitted opposite ways round. 

Figure 6.B.1 shows the steps in adding the first eight terms of an AP as the sums build 
up term by term. 
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Figure 6.B.1 



Turn it upside down and you have the identical situation. 
To show how we can use this, we'll take the example of the series (1) which is 
3 + 7 + 11 + . . . + 75 + 79. 

We have just found that it has 20 terms, so we can write, using S for 'sum', 

S 20 = 3 + 7 + 11 + . . . + 75 + 79. 
Reversing the order, we can also write 

S 20 = 79 + 75 + 71 + . . . + 7 + 3. 
Adding these two sums, we get 

2S ?() = 82 + 82 + 82 + . . . + 82 + 82 
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and there are 20 lots of 82. Therefore 

5 20 = \ X 20 X 82 = 820. 

We can now see how this same system will work for a general AP with a first term of a, 
a common difference of d and a last term, u n , of /, by writing 

S n = a + (a + d) + (a + 2d) + . . . + (I - d) + I. 
Reversing the order, we can also write 

S„ = I + (I - d) + (I - 2d) + . . . + (a + d) + a. 
Adding, we get 

2S n = (a + I) + (a + I) + (a + I) + . . . + (a + I) + (a + I). 
There are n terms here, so we have 

2S n = n(a + /) or S n = \n (a + I). 
Also, since / = u n = a + (n - \)d, we can say 

S n = \n{a + a + (n - l)d) = \n (2a + (n - l)d). 



The rule for the sum of n terms of an AP is 
S„ = - [a + l) = - \2a + (n - \)d). 



6.B.(c) The arithmetic mean or 'average' 

We define the arithmetic mean, A, of two numbers, a and b, to be the number which makes 
a, A, and b form an AP. 

In other words, the arithmetic mean of a and b is the midway value between a and b, since 
an arithmetic progression is formed by taking equal steps between the terms. 

This means that ,4 = ^ (a + b). A is what people commonly mean when they talk about 
the 'average' of two numbers. 

This definition can also be generalised by defining the arithmetic mean of n numbers to be 

a\ + a 2 + a 3 + a 4 + . . . + a n 
n 

Again, this is what is commonly meant by the 'average' of these n numbers. 

6.B.(d) Solving a typical problem 

Here is an example of a typical problem on APs. 

The 7th term of an AP is 23, and the 4th term is 14. Find the sum of the first 20 terms. 

First, we must find a and d from the information that we have been given. The 7th term is 
a + 6d, and the 4th term is a + 3d, so we have 

a + 6d = 23 (1) 

a + 3d = 14 (2) 

Subtracting equation (2) from (1) gives 3d = 9 so d = 3. Therefore 

20 

a = 5, and S 20 = — (10 + 19 X 3) = 670. 
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6.B.(e) A summary of the results for APs 

Before asking you to try some similar questions yourself, I will group together all the 
formulas which we have found for APs. 

• We write APs as a, a + d, a + 2d, . . . , where d is called the common difference. 

• The «th term is given by u„ = a + (n - \)d. If this is also the last term, we call it /. 

• The sum of n terms is given by S„ = n/2 (a + /) where / is the last or nth term, 

or S n = \ [2a + (n - l)d]. 

a + b 

• The arithmetic mean of two numbers, a and b, is . 

2 

• The arithmetic mean of n numbers, a x , a 2 , a 3 , . . . , a„, is 

a \ + a 2 + a 3 + a 4 + ■ ■ ■ + a n 

n 



exercise 6.B.1 Try these questions yourself. 

(1) For each of the following APs: 

(i) write down the values of a and d, 

(ii) find the number of terms in the series, 

(iii) sum the series. 

(a) 2 + 9 + 16 + . . . + 107 

(b) 100 + 95 + 90 + . . . + 15 

(c) 6 + 6\ + 6\ + . . . + rj\ 

(2) (a) Find the sum of the natural numbers from 1 to 100 (that is, find 1 + 2 + 3 + 

. . . + 100). 

(b) Find the sum of the even numbers up to, and including 100, starting with 2. 

(c) Find the sum of the odd numbers up to 100, starting from 1. 

(d) Find the sum of the first n natural numbers. 

(3) The first term of an AP is 11 and the sum of the first 18 terms is 1269. What is 
the common difference? 

(4) How many terms must be taken in the series 7 + 11 + 15 + ... for the sum to 
be 1375? 

(5) An AP is such that the third term equals twice the first term. The sum of the 
first ten terms is 195. Find the first term and the common difference. 



6.C Geometric progressions (GPs) 

6.C.(a) What are geometric progressions? 

We move on now to consider sequences like those in (d), (f) and (g) in Section 6 A.(a). Each 
of these is an example of a sequence in which each new term is found by multiplying the 
previous term by a constant amount. This amount is called the common ratio. A sequence 
like this is called a geometric progression, or GP for short. 
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We can write this type of sequence as a, ar, ar 2 , ar 3 , . . ., ar" 1 where a is the first term, 
and r is the common ratio. 



The «th term is ar" l . 



(Notice that it isn't ar". Again, we are one behind ourselves.) r is called the common ratio 
because if we divide any term by the previous term, we get r as the answer. 



u n ar" ~ 1 
It is always true for a GP that = — = r. 

u n _ l ar"' 2 



In other words, the ratio between any pair of successive terms is 1 : r. 
It is often helpful to use this property in problems on GPs. 
Taking (d) as a numerical example, we have a = 1 and r = 2, and 

2 4 8 16 

— = — = — = — etc. = the common ratio, 2. 
12 4 8 

6.C.(b) Summing geometric progressions 

How can we find S„ = a + ar + ar 2 + ar 3 + . . . + ar" ~ l l 

It will be no good turning the sum the other way round this time, as the two sums will 
not slot together nicely as they did for the AP. 

However, if we multiply S„ by r, the whole sequence gets shifted along by one. We get 

rS n = ar + ar 2 + ar 3 + . . . + ar" (1) 
S„ = a + ar + ar 2 + . . . + ar n ~ 1 (2) 
Can you see what makes a good next step? 



Subtracting (2) from (1) makes nearly everything disappear, and neatly gives us 
rS„ -S„ = ar" - a. 
Factorising, we get S„(r - 1) = a(r" - 1), so 

a(r" - 1) 

S n = — . (Gl) 

r - 1 



Equally, by multiplying the top and bottom of the previous formula by -1, we can write 
this as 



a(\ - r") 

S n = — ; • (G2) 

1 - r 
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The working is easier if you use (G2) when r is between -1 and +1, and (Gl) 
otherwise. 

Here are some typical problems on GPs. (You might like to try having a go yourself first, 
before looking at how I have done them.) 

(1) Sum the following GPs. 

(a) 2 + 6 + 18 + ... for the first 20 terms. 

(b) 1 - 2 + 4 - 8 + 16 ... for (i) 10 terms, (ii) 11 terms. 



The solutions for this first question are as follows: 

(1) (a) We want 5 20 with a = 2 and r = 3. 
Using formula (Gl), we have 

2(3 20 - 1) 

S 20 = = 3 486 784 398. 

3 - 1 

(b) We want (i) S w , (ii) S n , with a = 1 and r = -2. 
Again using (Gl), we have 

(i) S l0 = -j— = -341 

l((-2) n - 1) 

(ii) S n = = 683. 

-2 -1 

It seems as if, for this series, not only are the terms alternating in sign, but also the 
sums, as we add on each new term. 

6.C.(c) The sum to infinity of a GP 

Suppose we have the GP 24 + 12 + 6 + 3 + . . . and we want to find (a) S 4 , (b) S w and 
(c) S 20 . 

We have a = 24 and r = 5. 

(a) The easiest way to find S 4 is simply to add the first four terms, which gives us 45. 

It is slightly more convenient to use formula (G2) for (b) and (c). 

(b) S w is given by 

24(1 - (5) 10 ) 
S i0 = \ = 47.953125. 



1-2 



(c) Similarly, 



24(1 - (5) 20 ) 
5 20 = \ = 47.99995422. 

1 " 2 

We notice here that the difference between the sum of the first four terms and the first ten 
terms is small. The difference between the sum of the first ten terms and the first twenty 
terms is very small indeed. 

We can see why this is so if we look at the sum of n terms. We have 

24(1 - (5)") ! 

S„ = -^— = 48(1 -(£)»)■ 

1 " 2 
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As n becomes larger and larger, (jT will become smaller and smaller. In fact, by taking a 
sufficiently large value of n, we can make the value of (jT become as close to zero as we 
please, although it will never equal zero. 



We can write this mathematically by saying lim (5)" = 0. 



This means that the limiting value of (\) n , as n tends to infinity, is zero. The symbol 00 
represents infinity, a boundlessly huge amount. 

Since (5)" — > 0 as n — > °o, we see that the sum to which the series is approaching, is 48. 

We call this the sum to infinity, and write it as S x . 

The same kind of thing will happen with any r which lies between -1 and +1. 

The example which we have just looked at could be demonstrated by what happens if you 
start with a piece of string 48 centimetres long and cut it in half. Lay down the stretched out 
left-hand piece, and halve the right-hand piece. Continue with this process, each time laying 
the new left-hand piece end to end with the previous pieces, and halving the right-hand 
piece. The lengths which you have joined end to end are the same as the numbers in the 
sequence, and your infinite process (mathematicians have no problem in halving infinitely 
tiny bits of string) brings you closer and closer to your original 48 centimetres of string. 

Another way of explaining what conditions r must fit in order for us to have a sum to 
infinity is to say that we must have \r \ < 1 where \r\ means the absolute value of r. This is 
the value of r taken as positive, whatever the value of r itself, so for example, \j\ = 5 
but I — 3 1 = 3. \r\ < 1 means the same as -1 < r < + 1. 



The sum to infinity of a GP 

fl (l - r") a 

If \r\ < 1 and S„ = then S. x = . 

1 — r 1 — r 



^^^^ This sum to infinity only exists if \r\ < 1, so that the values of r" actually 
^H-^V do become smaller, as n becomes larger. 

For example, if we have the sequence 2, 6, 18, 54, . . . so a = 2 and r = 3, and 
we say that 

2 

St* = 2 + 6 + 18 + 54 + . . . = = -1 

1 - 3 

it is clearly absolute nonsense. (It must be, because now r" is getting larger 
and larger.) 



6.C.(d) What do 'convergent' and 'divergent' mean? 

A series whose sum becomes closer and closer to a definite finite value, S x , as we take a 
larger and larger number of terms, is called convergent. 

For a convergent series, it must be possible to make the difference S„ - S x as small as we 
please, by taking a large enough value of n. 
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If a series is not convergent, then it is called divergent. 

An AP is always divergent. However tiny we make each individual step, we can always 
add together enough terms to get an absolute total which is larger than any number we are 
challenged with, because each step is equal in size. 

The different sums that we can find by taking different values of n are called partial 
sums. For example, if we have the series 1+2 + 4 + 8 + 16 + ..., then S x = 1, S 2 = 1 + 2 = 3, 
S 5 = 1 + 2 + 4 + 8 + 16 = 31 and each of these are partial sums. 

6.C.(e) More examples using GPs; chain letters 

The following three examples also use GPs. 

(1) How many terms of the GP 1 + 2 + 4 + 8 + . . . are required for the sum to be 
greater than one million? 

(2) The third term of a GP is 72, and the sixth term is 243. Find the first term. 

(3) The numbers n + 1, n + 5, and 2n + 4 are consecutive terms in a GP. (Consecutive 
terms are terms which come immediately after each other in order.) Find the 
possible values of n, and of the common ratio. Find also the values of the three 
given terms in each case. 

Have a go at these yourself before looking at what I have done. 



Here are my answers. 

(1) We have 1 + 2 + 4 + 8 +. . . 

Suppose we let n be the first number for which S„ > 1 000 000. 

1(2" - 1) 

a = 1 and r = 2 so S„ = = 2" - 1 . 

2-1 

2"- 1> 1000000 so 2" > 1000001. 
Taking logs to base 10 both sides, we have 
log 10 (2") >log 10 (1000001). 
Using the third law of logs from Section 3.C.(d), we have 

log 10 (1 000 001) 

«log 10 (2) > log 10 (1 000 001) so n > . 

log io (2) 

Therefore n > 19.93 to 2 d.p. 

The first whole number for which this is true is 20, so n = 20. 

This series appears in the story of the slave who was offered a reward by a 
grateful King. Spurning gold, he asked for wheat to be placed on a chess-board, 
with one grain for the first square, two for the second, and the number of grains 
doubled for each subsequent square. We have seen that there were already over a 
million grains by the 20th square. For the 64th square, he had 2 64 - 1 grains. This 
is a seriously large number. If each grain is j cm long, and they are placed end to 
end they stretch more than one million times round the equator. 

Chain letters do not work for the same reason. Suppose you receive a chain 
letter asking you to post £1 to the sender, and then send off two identical letters 
yourself. In theory, you end up £1 better off, but, in practice, this is exactly the 



6.C Geometric progressions 



237 



same situation as the grains of wheat. By the twentieth step in the chain, even with 
the number of letters only doubling each time, over a million people are involved, 
and clearly the system must break down. The more letters there are in each step of 
the chain, the sooner it breaks down. The only people who will safely make money 
are those near the beginning of the chain. For them, the larger the number of letters 
the better they do. The system is, in effect, a confidence trick. 

(2) The third term of the GP is 72 so ar 2 = 72. 

The sixth term is -243 so ar 5 = -243. Dividing, we get 
ar 5 243 
ar 2 72 ' 

Because GPs are formed by continued multiplication, dividing is often a technique 
which works well. 

Cancelling down gives us r 3 = -3.375. 

This can be solved on a calculator by finding the cube root of +3.375, by using 
the 'x Vy ' key. 

This gives 1.5, so the cube root of -3.375 is -1.5. 
Now, 72 = a(-1.5) 2 , so a = 32. 

(3) The ratio from dividing consecutive terms of a GP is constant, so 

n + 5 2n + 4 

= = the common ratio, r, of the series. 

n + 1 n + 5 

We have 

(n + 5)(h + 5) = (n + l)(2n + 4) 

so n 2 + lOn + 25 = In 2 + 6n + 4 

which gives 

n 2 -4n-2l = 0. 

Factorising this, we get 

(« - 7) (n + 3) = 0 so n = 7 or n = -3. 

Both of these answers are possible. 

We substitute back each in turn into (n + 5)1 (n + 1) to find the common 
ratio. 

12 3 

If n = 7, the common ratio is — = j, and the three terms are 8, 12 and 18. 
If n = -3, the common ratio is -| = -1, and the three terms are -2, 2 and -2. 



6.C.(f) A summary of the results for GPs 

• We write GPs as a, ar, ar 2 , . . . , where r is called the common ratio. 

• The nth term is ar" ~ 1 . 

• The sum of n terms is given by 



S„ = (best used if \r\ is greater than 1) (Gl) 

r - 1 

or 

a(l -r") 

S n = (best used if \r\ is less than 1). (G2) 

1 - r 



238 



Sequences and series 



• If |r| < 1, then 

S„ = — (G3) 
1 -r 

• \r\ < 1 means the same thing as -1 < r < +1. 

exercise 6.c.i This exercise introduces some very important ideas, so you should do it now as I 

shall use your answers straight away to show you how things work. Don't be 
tempted just to look at mine - thinking about your own answers makes an infinite 
difference to how much you learn. 

(1) Which of the following GPs are convergent? If they are convergent, find the 
sum to infinity in each case. 

(a) 12 + 18 + 27 + . . . (b) 18 + 12 + 8 + . . . 

(c) 64 - 48 + 36 - 27 + . . . (d) 16 - 40 + 100 - 250 + . . . 

(e) 1-1 + 1-1 + 1-1+.. . (f) + + 

(2) The sum of the first two terms of a GP is 30, and the sum of the second and 
third terms is 20. Find the first term and the common ratio. 

(3) The numbers n + 3, 3/7 - 3, and 5/7 + 3 are consecutive terms of a GP. Find the 
possible values of n and of the common ratio. Find also the values of the three 
given terms in each case. 

(4) (a) Which is the first term of the GP 3 + 12 + 48 + . . . to be greater than 

1 000 000? 

(b) How many terms of this GP are required in order to make a sum which is 
greater than 10 10 ? 



These are the answers which I hope you will have found. 

3 1 

(1) (a) r = 5 so \r\ > 1 and the series is not convergent. In fact, we can easily see that 
the sums will increase rapidly. 

(b) r = I so \r\ < 1 and the series is convergent. 

18 

$00 " t ~~ 54. 

i-l 

(c) r = - \ so \r\ = \ < 1 and the series is convergent. 

64 256 4 

Sao = T~~ = = 36^. 

!-(-!) 7 

(d) r = - § so \r\ > 1 and the series is not convergent. 

(e) r = - 1 so \r\ < 1. The symbol '<' means 'is not less than'. The series is not 
convergent. 

In fact, a very curious thing happens with (e). 

Normally, if we are adding a string of numbers, we can add them in any 
order that we please, so for example 

1 + 2 + 5 + 18 + 24 = (1 +2) + (5+ 18) + 24 = (1 + 2 + 5) + (18 + 24) etc. 

Here, if we put in brackets to group the terms, we get a very odd result. 
It would appear that it is possible to say 

S„ = (l-l) + (l-l) + (l-l) + ... = 0. 
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Also, it would seem reasonable to say 

S„ = 1 + (- 1 + 1) + (- 1 + 1) + (-1 + 1) + . . . = 1. 
Clearly, something is going wrong here. 

The fault in the argument is that, by taking the sum to infinity, we are 
implicitly assuming that the sum of this series is going to get closer and closer 
to a definite number the further we go. Here, this is not at all true. In fact, if 
we take an even number of terms the sum is zero, and if we take an odd number 
of terms the sum is 1, and there is a continual flip-flop between the two. The 
sum to infinity does not exist and the series is divergent. 

At the time when mathematicians were first working on the theory of 
infinite series, around the beginning of the nineteenth century, this kind of 
result caused considerable consternation, followed by a big jump forwards in 
understanding. It is often the cases which behave in peculiar ways which lead 
to advances in maths, because they make it necessary to look in more detail at 
what is actually going on. Situations like the one above make it evident that 
everything is not always as it seems, and that it can be dangerous to jump too 
soon to conclusions. 

It is true that we can group together the terms in any way we please in any 
finite sum of numbers. Also, if all the terms are positive, we can group the 
terms in any convenient way in an infinite series, because each next term is just 
another step up in the staircase. Putting some steps together into a larger step 
will make no difference to the total height of the staircase, whether this height 
is infinite or not. 

(f) Here, r = - j so |r| < 1 and the series is convergent. 



If we calculate some partial sums, that is, sums of different numbers of terms, 

2 

we find that they are alternately larger and smaller than j, but getting closer 
and closer to this value the more terms of the series we take. (Try this for 
yourself, using a calculator.) By taking a sufficiently large number of terms, 

2 

we can get as close to ? as we please. Furthermore, and importantly, any greater 

2 

number of terms will bring us even closer to 3. 

(2) Writing the given information mathematically, we have 

f a + ar = 30 (1) 

I ar + ar 2 = 20 (2) 

These equations can be solved rather neatly in the following way. Instead of 
writing equation (2) in the obvious factorisation of ar(\ + r) = 20, we write it as 
r(a + ar) = 20. We do this because the (a + ar) exactly matches up with the first 
equation. 

Now we can substitute in this new equation, using equation (1), and we get 

2 

30r = 20 so r = 3. Then, since a(l + r) = 30, a = 18. 

(3) The ratio of successive terms of a GP is the same, so 

3n - 3 5n + 3 

= = the common ratio. 

n + 3 3m - 3 
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So 

9n 2 - 18n + 9 = 5ra 2 + 1 8« + 9. 
4« 2 - 36« = 0 so, factorising, we have 

4n(n - 9) = 0 so n = 0 or 9. 
If n = 0, we get r = -1 and the three terms of the series are 3, -3, 3. 

24 

If « = 9, r=j2 = 2 and the three terms are 12, 24 and 48. 

(4) Here, a = 3 and r = 4. 

(a) Let « be the first number for which u„ is greater than 1 000 000. Then 

, 1000000 

u„ = 3(4)"-' > 1000000 so 4"- 1 > . 



© 



Taking logs, we have 

1000 000 

3 

Now, using the third law of logs, we get 



log 10 (4- 1 )>log 10 



(n - 1) log 10 (4) > log 10 



1 000 000 



3 

from which «-l>9.17to2 d.p. So the first possible integer value of n is 11. 
(b) Now let n be the first integer such that S„ > 10 10 . 



In the first part of this question, we are looking for the first term which is 
larger than some given value. In the second part, we are looking at the size 
of the sum of all the terms up to that point. Students quite often mix up 
these two different situations. 



We have 
3(4" - 1) 



> 10 10 so 4" > 10 10 + 1. 



4 - 1 

Taking logs, and using the third law, we have 
nlog 10 (4) > log 10 (10 10 + 1) 
so n > 16.6 to 1 d.p. The first possible integer value of n is 17. 

6.C.(g) Recurring decimals, and writing them as fractions 

We come next to some applications of GPs. 

The first of these gives us a way to convert some decimals to fractions. The strength of 
the decimal system for writing fractions is that it uses the same system of place values based 
on powers of 10 as our system of whole numbers uses. This means that decimal fractions are 
particularly easy to add and subtract and multiply, in just the same way that whole number 
calculations are straightforward with our number system. If you've ever tried adding or 
subtracting with Roman numerals, you will appreciate this. 



6.C Geometric progressions 



241 



Here are some examples of the place values. 

3 4 7 47 

0.3 means — , 0.47 means 



10 10 100 100 

10 8 108 
and 0.108 means — + + 



10 100 1000 1000 
(In general, we simply put a zero underneath for every digit on the top.) 



6 



Don't be tempted to say that g, for example, is 0.8! 

In fact, to write | as a decimal, we divide the bottom into the top and our 
number system automatically takes care of the rest so | = 0.125. 



A single-digit repeating decimal, like f = 0.333 ... is written as 0.3. 
In a similar way, = 0.090909 . . . = 0.09, where the line signifies that these two digits 
are repeated. 

Both of these examples are called recurring decimals, because the same group of digits 
is repeated infinitely. 

What happens if we want to convert a recurring decimal into fraction form? 
For example, suppose we have 0.17171717. .. or 0.17. 

It is no use trying to use our rule of zeros underneath for each digit, as this gives us a 
fraction with an infinitely long top and bottom. 

Instead, we use exactly the same device which we used to find the sum of a GP. In other 
words, we multiply by a number which slides everything along so that it exactly slots for a 
subtraction to work. Suppose we let 

F= 0.171717. . . 

Then 

100F= 17.171717. . . 
and, subtracting, we get 

17 

99F=17 so F = — . 

99 

You can check this result on your calculator, allowing for the fact that, as it gives a limited 
number of decimal places, it will round the last digit. 

The reason that the same technique works so well is that 0.171717 . . . is a GP. We can 
see this by writing it as 

0.17 = 0.17171717 . . . 

' ^ ( 17 ) + (tTtT ( 17 ) + (tTt) ( 17 ) + • • • 



100/ \100/ \ 100 



We have 



17 1 

a = and r = . 

100 100 
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\r\ < 1, so the sum to infinity of this series exists. 

= a = m 17 = ]7 

1 - r 1 - 100-1 99 

which agrees with our previous result. 

Here is another example. Find in fraction form 

12.4125125125. . . or 12.4T25. 

What do you think we should multiply by this time in order to slot everything into the 
optimum position? 



It will need to be 1000. (It is the number of digits which are repeated which is important 
here.) 

If we let F = 12.4125, then we have 
1000F= 12412.5125125 . . . 
F= 12.4125125... 

Subtracting, we have 999F = 12400.1, so 

12400.1 124001 

F= = 

999 9990 

multiplying top and bottom of this fraction by 10, to tidy it up. 

exercise 6.C2 Try converting the following decimals to fractions yourself. 

(l) 0.7 (2) 0.25 (3) 0.401 (4) 0.011 (5) 0.7 

(6) 0.29 (7) 2.534 (8) 40.2106 (9) 0.142857 

6.C.(h) Compound interest: a faster way of getting rich 

Another application of GPs is in calculating compound interest. If money is invested to 
obtain compound interest, this means that, in each successive period (usually a year or six 
months), you not only receive money on the original amount invested (the principal) but 
also on the accumulated interest so far obtained. 

With simple interest, on the other hand, you receive only the interest on the original 
capital or principal. 



example (1) James invests £800 at 5% compound interest per annum (year). How 
much money has he at the end of six years? 

Compare this with what he would have received if his money was 
invested at 5% per annum simple interest. 

We will look at how much he gets with simple interest first. 
At the end of the first year, he receives 5% extra, so he gets 
5 

X £800 = £40 extra. 

100 

Exactly the same thing happens in the other five years since he receives 
no extra interest on his accumulating interest. So at the end of six years 
he will have 

£800 + 6 X £40 = £1040. 
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Under the compound interest system, the result at the end of the first 
year is unchanged. Writing what happens in detail, we see that he has 

5 /105\ 

£800 + (£800) = (£800) = (1.05) (£800) = £840. 

100 \ 100/ 

Now the difference in the two systems starts to show because the 
interest for the second year is calculated from the total amount of 
money he now has. 

At the end of the second year, he has 

(1.05) (the amount now there) = ((1.05)(1.05)(£800)) = (1.05) 2 £800. 

So, at the end of six years, he has (1.05) 6 £800 = £1072.08 to the 
nearest penny. 

We see that he is £32.08 better off with the compound interest. 

When James is on a system of simple interest, the steps of his increases form an AP with 

V = 800 and '<f = 0.05 X 800 = 40. 

When he is on a system of compound interest, the steps of his increases form a GP with 

V = 800 and V = 1.05. 

How much money does James have in total after n years? 

If the money was invested at 5% simple interest, he will have n X (0.05 X £800) in 
accumulated interest, giving him a total of £800 + 0.05n(£800). 

If his money was invested at 5% compound interest, he would have (1.05)" £800 
altogether. 

Notice that these two formulas give us practical examples of working sequences. 
The sequence for his totals with simple interest over periods of a year, in £ units, is the 
AP which goes: 

800, 840, 880, 920, . . ., [800 + (n - 1) (0.05 X 800)], . . . 

The nth term of this AP is 800 + (n - 1) (0.05 X 800). 

This can also be written as a recurrence relation or difference equation, using the 
method of description (B) from Section 6.A.(b). We would write 

u n = w„_i + (0.05 X 800) = u n _ ! + 40 with u x = 800. 

The sequence for his totals with compound interest form the GP 

800, 840, 882, 926.10, . . ., (1.05)" 1 800, . . . 

with (1.05)"" 1 800 as its nth term. 

It can also be written as a difference equation in the form 

u n = (1.05)m„_ i with iii ~ 800. 

What if James invests the same amount each year with compound interest? 

Suppose that he was able to invest £800 at the beginning of each of the six years at the same 
rate of compound interest of 5%. How much would he have altogether on 2 January of the 
seventh year, when he has just deposited his most recent £800? 
He would have 

£800 + (1.05)£800 + (1.05) 2 £800 + . . . + (1.05) 6 £800 

which is a GP with a = £800, r = 1.05, and n = 7. So his total investment is 

800 ((1.05) 7 - 1) 

S 7 = = £6513.61. 

1.05 - 1 
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6.C.(i) The geometric mean 

We have already seen that the arithmetic mean, A, of two numbers, a and b, is defined as the 
number A such that a, A and b form an arithmetic progression. 

In a similar way, we define the geometric mean G, of two positive numbers a and b, to 
be the number such that a, G, b are in geometric progression. 

So a, G, b can also be written as a, ar, ar 2 giving G = ar and b = ar 2 . Now 

ab = a{ar 2 ) = a 2 r 2 = G 2 so G = -Jab. 

For example, suppose we have the pair of numbers 2 and 8. 

The arithmetic mean of these two numbers is the midway point of 5 (Section 6.B.(c)). 
This then gives a mini AP of 2, 5, 8 with a common difference of 3. 

The geometric mean of these two numbers is 4, given by ^2X8, resulting in the mini 
GP of 2, 4, 8 with common ratio 2. 

The definition of the geometric mean can also be extended to n numbers, provided that 
they are positive, in the following way. 

If the numbers are a x , a 2 , a 3 , . . ., a n then the geometric mean is n -\a x a 2 a 3 . . . a„. 



6.C.(j) Comparing arithmetic and geometric means 

We can also show that the arithmetic mean of any two positive numbers a and b is greater 
than their geometric mean. We have to show that 

a + b , 

> Jab- 

2 

This can be done rather neatly by putting a = x 2 and b =y 2 . Since we have said that a and b are 

positive, this is a safe move, and it gets rid of the y sign. We now have to show that 

x 2 + y 2 

> xy. 

2 

Can you see how the rest of the argument will go? 



We must show that x 2 + y 2 > 2xy. 

So we must show that x 2 + y 2 - 2xy > 0, that is, that (x - y) 2 > 0. But (x - y) 2 must be 
either positive (or zero, if x = y), since it is something squared. Therefore A > G. 



6.C.(k) 



What is the fate of the frog down the well? 



I will finish this section by asking you the following question. 

A frog is at the bottom of a well. He finds that he can jump up the side 
of the well, hanging on briefly between jumps. This procedure is exhausting 
so he jumps a shorter distance each time, starting with 1 m then \ m, \ m, 
and so on, so that the total height he has reached after n jumps is given by 



1 



1 



+ — metres. 
n 



1 1 1 

—+—+—+ 

2 3 4 

Obviously, if the well is only 2 metres deep, he will have escaped by his 
fourth jump. How deep must the well be for him never to escape, or will he 
always gain his freedom? 
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It is worth testing your ideas here numerically in any way you can. 

You could sum as many terms as you have the patience for on a calculator to get some 
idea of what is happening. 

Even better, if you can write computer programs, you could test any particular depth 
which you might think would definitely spell the frog's doom, by seeing if there is some 
number of jumps whose sum would actually come to more than this depth, so that he does 
escape. (I shall return to this puzzle later on in this chapter.) 

6.D A compact way of writing sums: the 2 notation 

6.D.(a) What does 2 stand for? 

We have looked fairly thoroughly at APs and GPs because they are relatively easy to sum, 
and also come up quite often in practical situations. Now we will widen the field by looking 
at some other kinds of series. 

To make this easier, I will show you a neat new method of writing the sum of a series. 
It is called the 2 notation, from the Greek capital letter S which is written 2, and pronounced 
' sigma'. 

Ill " 1 

To write 1 H 1 H . . . H — in this notation, we write A — • 

2 3 n r=\ r 

What we have done is to write down the sum using the general term of the series. The value 
of r at the bottom of the 2 gives the first term, and the value (of r) at the top of the 2 gives 
the last term. You can think of this 2 as meaning 'The sum of all such things as 1/r with r 
going from 1 to n\ 

#The letters used need not necessarily be r and n but the general idea will be 
the same. 



Here is another example, which uses n as the letter inside the 2. 

10 

X w = l+ 2 + 3 + ... + 10. 

n=\ 

The r in the first example and the n in the second example are dummy variables with the 
information about how far they run being written at the bottom and the top of the 2. Once 
this information has been filled in, the answer will be purely numerical, and it won't matter 
what letter we chose to use. 



exercise 6.D.1 Try writing the following in 2 notation for yourself. 

123 11 

(1) 1 + 4 + 9 + 16 + . . . + 81 (2) — + — + — + ... + — 

234 12 

111 1 

(3) + + + . . . + 

1X2 2X3 3x4 29x30 

(4) -1 + 4 - 9 + 16 - 25 + ... - 81 Be ingenious! 
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6.D.(b) Unpacking the 2s 

It will be quite useful for you to get some practice here in unpacking the 2 notation into the 
separate numerical terms, as sometimes it is necessary to convert back in this way. 
Here is an example of this. 

Find the sum of the first four terms, and also write down the nth term and the (n + l)th 
term, of the series 

i — 1 — . 

r= \ r(r + l)(2r + 1) 
The first four terms are 

1111 



1(2)(3) + 2(3)(5) + 3(4X7) + 4(5)(9) 
feeding in r = 1, 2, 3, 4 in turn. Tidying up, we get 

11 11 137 

6 30 84 180 630 
The nth term is 

1 



putting r = n. 



n(n + \){2n + 1) 
For the (n + l)th term, we put r = n + 1, and get 
1 



1 



(n + 1)(« + 2) (2(n + 1) + 1) (it + l)(n + 2)(2n + 3) 

Students sometimes find this last procedure a bit tricky, but it is well worth practising it now 
because you will need it if you have to work with more complicated series. 



EXERCISE 6.D.2 



For each of the following series, write down the first four terms, and then add 

them together. Also, write down the nth term and the (n + i)th term. 

n n n 

(i) I (2r +3 ) (2) I &(\Y-* (3)1 ~ 

r = i r! 



r = l 
n 



(4) £ 



r = i \r + 2 



r = l 
n 



(-i) r+5 



(5) I 



r = i (2r - l) (2r + l) 



6.D.(c) Summing by breaking down to simpler series 

Sometimes it is possible to sum series by breaking them down into simpler series which have 
known sums. I will give you some examples of this, using the following three standard sums. 



1+2 + 3 


+ 4 + 


n 

. . + n = 1 r 

r=\ 


= \n(n + 1) 


(SI) 


l 2 + 2 2 + 3 2 + 


4 2 + . 


n 

. + n 2 = X r 2 

r=\ 


= \n(n + 1)(2« H 


" 1) (S2) 


l 3 + 2 3 + 3 3 + 


4 3 + . 


. + n 3 = Z r 3 
r= 1 


= 5» 2 (« + l) 2 


(S3) 
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(If not knowing where these have come from worries you, we showed the first one when 
we did APs in question 2(d) of Exercise 6.B. 1 . The other two are shown to be true in the next 
chapter in Section 7.D.) 

Here is an example of how they can be used. 

n 

Find X (r + l)(r + 2). 
r= 1 

n n 

X (r + l)(r + 2) = X (r 2 + 3r + 2). 
r= 1 r= 1 

This can then be split into separate sums since it makes no difference what order we do the 
adding in. We say 

n n n n 

X (r 2 + 3r + 2)= X r 2 + X 3r + X 2. 
r = 1 r = 1 r=l r = 1 

Also, 

X 3r = 3 X r 

r=\ r = 1 

since multiplying each separate number by 3, and then adding, is the same as adding first 
and then multiplying the total by 3. 

You can see all this actually working if I put n = 3. 

3 3 3 3 

X (r 2 + 3r + 2)= X r 2 + 3 X r+ X 2. 

r = 1 r = 1 r=l r = 1 

The LHS of this is (l 2 + 3 + 2) + (2 2 + 6 + 2) + (3 2 + 9 + 2) = 38. 
The RHS of this is (l 2 + 2 2 + 3 2 ) + 3(1 + 2 + 3) + (2 + 2 + 2) = 38. 

A „,„. 

So we have 

X (r + l)(r + 2)= X r 2 + 3 X r+ X 2. 

r = 1 r = 1 r = 1 r = 1 

Using (SI) and (S2), we find this is the same as 

I n{n + l)(2n + 1) + 3 [jn(n + 1)] + In. 

(The 2 is now being added n times.) 

Factorising this by taking out g«, we get 

\n[(n + 1) (2n + 1) + 9 (n + 1) + 12]. 

(It is good to have the | out of the way in the front. If you are doubtful about what is inside 
the bracket, check by multiplying out.) Multiplying out the inside brackets, we have 

\n[{2n 2 + 3n + 1) + (9n + 9) + 12] = \n{2n z + \2n + 22) = \n{n 2 + 6n + 11) 

taking out an extra factor of 2, and cancelling. So 

n 

X (r+ \){r + 2) = \n(n 2 + 6n+ 11). 
r= 1 
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Check: If n = 3, we have just seen that 

3 

LHS = £ (r + l)(r + 2) = 38. 

r= 1 

Putting « = 3 in the answer gives 

RHS = \n(n z + 6n + 11) with « = 3, which is f (3)(9 + 18 + 11) = 38. 

exercise 6.D.3 Try these two yourself. Find 

n n 

(i) £ (r-i)(r+ 3 ) (2) £ r(r-i)(r+i). 

r = l r = l 

In each case, check your answers by putting n = 3. 



6.E Partial fractions 

6.E.(a) Introducing partial fractions for summing series 

In the earlier part of this chapter, we found out how to sum APs and GPs. Now we look at 
a rather ingenious technique which can be used for summing series involving fractions. 
(This particular technique also has many other uses.) 
Suppose we want to find 

i-L- 

r= 1 r(r + 1) 
that is, we want to find 

1111 11111 1 

+ + + + . . . + = — + — + — + — + ... + . 

1.2 2.3 3.4 4.5 n(n + 1) 2 6 12 20 n(n + 1) 

As it stands, there is no simple way of calculating this sum. 

However, the fraction r(r + fj looks as if it has come from putting two simpler fractions 
into one single fraction, as we did in Section l.C.(c). Suppose we try writing 

1 _ A B 

r(r +1) r r + 1 

where A and B are standing for numbers which we would need to find out. I've used the ' = ' 
sign here to emphasise that the two sides are just different ways of writing the same thing. 
What we have here is another example of an identity. I explained what this means in Section 
2.D.(h). 

To find A and B, we get rid of fractions by multiplying through by r(r + 1). 

Cancelling where possible, we get 1 = A(r + 1) + Br. 

Since this is just a rewriting, or identity, it must be true for all values of r. 

Putting r = 0, we get 1 = A. 

Putting r = - 1, we get 1 = - B, so B = -1. 

We can check by putting r = 1 , say. With these values of A and B, we get the LHS = 1 , 
and the RHS = 2-1 = 1 also. 

We now know that we can replace 

1 1 1 

by - . 

r(r +1) r r + 1 
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Will this help us? We can say 

n i n 

1 - 1 



= 1 



/ 1 


l 


\r 


r + I, 




n 


u 


- X 




r= 1 


1 


1 1 


— + 
2 


— + — 

3 4 


1 

+ — 


1 

+ — + 



= i i V - + - f - 4 . . . + - 



111 1 



2 3 4 n n+ 1 

and we see that it does indeed help us. 

The second bracket is almost exactly the same as the first bracket. It has the same number 
of terms, but everything has been slid one place to the right. 

When we do the subtraction, we are left with just 1 - l/(n + 1) so 

n ! , 

1 = 1 . 

r= 1 r(r + 1) n + 1 

You can check that this actually works by putting n = 2. This gives a LHS of 
2 + 6 and a RHS of 1 - 3, so the two sides do come out the same. 

What will happen as n becomes very large? Will this series have a sum to infinity? In 
other words, is it convergent? 



The larger n gets, the closer \l(n + 1) becomes to zero, so the sum of the series will get 
closer and closer to 1 . 

The series is convergent, with a sum to infinity of 1 . We can say 

00 1 

1 =1. 

r= \ r(r + 1) 

Now have a go at using the same method yourself to find the sum of the series 

2 2 2 2 2 "2 

- + - + — + — + ... + = Z • 

3 8 15 24 n(n + 2) r= l r(r + 2) 



Check how you got on. 

2 A B 

can be split up into two simpler fractions as — + . 

r(r + 2) r r + 2 

Then, multiplying by r(r + 2) to get rid of fractions, we have 

2 = A{r + 2) + Br. 

Putting r = -2 gives 2 = -2B, so B = -1. 
Putting r = 0 gives 2 = 2A, so A = 1. 

Checking, by putting r = 1, we have the LHS = 2 and the RHS = 3-1=2. 
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We can therefore say 
2 1 



r(r + 2) r r + T 
and we now have 

n 9 n i n 

1 1 - 

r=\ r(r + 2) r = 1 r r= 1 r 



111 1 

1 + — + — + — + ... + — 
2 3 4 n 



11 11 1 

- + — + ... + — + + 



3 4 n n + 1 n + 2 

(The last three terms in the second bracket come from putting r = n - 2, n - \, and n 
respectively.) 

This time, it is as though the right-hand bracket has been slid along two places instead 
of just one, as it was in the previous example. 

Subtracting all the overlapping parts, we are left with 

"2 I \\ I 1 1 \ 3 1 1 

X -= 1 



r - 



lr(r + 2) \ 2/ \n + 1 n + 2) 2 n + l n + 2 



1 1 

Both and will become very small as n becomes large. We can say that 

n + 1 n + 2 

1 1 

—> 0 and > 0 as n — > °° 



n+l n+2 

3 

so we see that the sum of the series is getting closer and closer to j- 

n o 

The series A is convergent, and its sum to infinity is 

r = 1 r(r + 2) 

3 

The number 5 forms a barrier beyond which the sum cannot go, however many extra 
terms we add although we can get as close to it as we please if we take a sufficiently large 
number of terms. (We never quite get there, though! We are always a tiny bit less than it since 
all the terms of the series are positive.) 

6.E.(b) General rules for using partial fractions 

When we summed the series 

"1 "2 

X and Z , 

r= \ r(r +1) r= 1 r(r + 2) 

we split up the complicated fraction into two simpler fractions, in each case. 

This technique of rewriting complicated fractions in the form of separate simpler 
fractions is called the method of partial fractions. It is often extremely useful, not only for 
summing series as we have already used it, but also in integration, as you will see in 
Section.9.B.(e). 

Because it is such an important technique, we shall look at it now in more detail. The two 
examples which we have already met both had two factors underneath. If the fraction has 
more factors underneath, it is simply split into more fractions. 
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So, for example, 

6 ABC 

is written as + + , 

(x - l)(x + l)(2x +1) x - 1 x + 1 2x + 1 

where A, B and C are standing for numbers which we have to find. 

Getting rid of fractions as before, by multiplying by (x - 1) (x + 1) (2x + 1) and cancelling 
where possible, we get 

6 = A(x + l)(2x + 1) + B(x - l)(2x + 1) + C(x - 1) (x + 1). 

Putting x = 1 gives 6 = 6A, so A = 1. 
Putting x = -1 gives 6 = IB, so B = 3. 

1 3 

Putting x = - 2 gives 6= - |C, so C = -8. 

Notice that we cunningly choose values of x so that two parts get knocked out each time, 
and we can easily find the value of the remaining letter. 

Then it is sensible to check the values we have found by putting x = 0, say, with these 
values, and making sure that the two sides balance. 

Here, the LHS = 6, and the RHS = J- 5- C= l- 3 + 8 = 6. 

Often, finding the partial fractions is only a small part of the complete problem, so it is 
wise to check that nothing has gone wrong at this stage. 

6.E.(c) The cover-up rule 

In a case like the above, it is also possible to find A, B and C by what is known as the cover- 
up rule. 

To do this, we choose each of the three values of x in turn which gives a zero in the 
denominator of 

6 

(x - l)(x + l)(2x + 1) 

(that is, we choose the same three values which we used in the previous working). 

Suppose we start with x = 1. Then we cover up the bracket (x - 1), and feed x = 1 into 
the rest of the fraction. 

This gives 6/6 = 1 as A, the number over (x - 1). 

Similarly, covering up (x + 1), and feeding in x = -1 to the rest of the fraction, gives 
5 = 6/2 = 3. 

Finally, covering up (2x +1), and feeding in x = -\ to the rest of the fraction, gives 
C = -8. 

You can use whichever method you prefer. 



exercise 6.E.1 Use whichever method you find most convenient to write the following as partial 

fractions. 

4 6 10 

to ~. r, r (2) ~. r. r (3) 



(x + 2)(x + 3) (2y-i)(2/+i) x(x-i)(x + 4) 



6.E.(d) Coping with possible complications 

Unfortunately, sometimes complications arise. These can be split into three types and I'll 
describe each of them in turn. 
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Repeated factors 

Suppose we have the fraction 

4 

(x + l)(x- l) 2 ' 
Can we say 

4 A B 

= + ? 

(x + l)(x - l) 2 x + 1 (x - l) 2 

We'll see what happens when we try to find^ and B. 

Getting rid of fractions, we have 4 = A(x - l) 2 + B(x + 1). 
Putting x = 1 gives 4 = IB so B = 2. 
Putting x = -1 gives 4 = 4A so A = I. 

Now check with x = 0. The LHS = 4 and the RHS = 1+2 = 3. 
Clearly, something has gone wrong! 

If we think what fractions we could have put together to give the original fraction then 
we see that there could have been a hidden one extra to the two which we wrote down above. 
Can you see what this extra one is? 



There could also have been the fraction 
C 
x- f 
If we now write 

4 ABC 

= + + 

(x + l)(x - l) 2 x + 1 (x - l) 2 x - 1 

and get rid of fractions by multiplying by (x + l)(x - l) 2 , cancelling where possible, we get 
4 = A(x - l) 2 + B(x + 1) + C(x - l)(x + 1). (1) 

^^^^ You need to think carefully here about the cancelling down. If you try to get 
^H-^V rid of the fractions on autopilot, you will almost certainly go wrong. 

Now, putting x = 1 we get 4 = 2B so B = 2 as before. 
Putting x = - 1 gives us 4 = 4A so A = 1, also as before. 

To find C, we can apply the very useful technique which we employed when we were 
factorising cubic equations in Section 2.E.(a). 
The way to do this is as follows. 

Since equation (1) above is an identity, the coefficients of each separate power of x on 
each side of it must match up. For example, there must be the same number of x 2 terms on 
each side; this is the only way that (1) can be true for all values of x. 

Looking at the terms in x 2 , we have 0 = ^x 2 + Cx 2 so C = -A so C = -1. 

Now we check again, putting x = 0. 
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This time, the LHS = 4 and the RHS = 1+2 + 1=4, which is a much better state of 
affairs. Our final result is 

4 12 1 

(x + l)(x -l) 2 ~ x + 1 (x - l) 2 x - f 



The rule for dealing with repeated factors 

If there is a repeated factor underneath, we must put in extra fractions to make up 
the whole power. For example, 

1 A B C D 

= + + + . 

(x + 1)(jc + 3) 3 x - 1 x + 3 (x + 3) 2 (x + 3) 3 



exercise 6.E.2 Try these two for yourself. Find partial fractions for 

5 2 

(l) , (2) . 

(x - 2)(x + 3) 2 y 2 (y - l) 



Non-linear factors 

3 3 

Suppose we have (1) and (2) 

(x + l)(x 2 - 4) (x + l)(x 2 + 4) 

How could we split up (1) to find its partial fractions? 



We could use the difference of two squares (again!) on x 2 - 4, and write 
3 3 ABC 

(x + l)(x 2 - 4) (x + l)(x - 2)(x + 2) x + 1 x - 2 x + 2 ' 
Finish this for yourself. You should get 

3 -1 - - 

(x + l)(x 2 -4) x+1 x-2 x + 2 

However, when we come to (2), we can't split up x 2 + 4 into two linear factors. (A linear 
factor is one like (x + 2) where, if we plotted y = x + 2, we would get a straight line.) 

Now, if we are dividing by x 2 + 4, the remainder can have xs in, as well as numbers, so 
we have to split (2) up into partial fractions as follows: 

3 A Bx + C 

(x + l)(x 2 + 4) ~ x + 1 x 2 + 4 ' 

Getting rid of fractions, 3 = A(x 2 + 4) + (Bx + C)(x + 1). 

3 

Putting x = -1 gives 3 = 5A, soA = $. 

Putting x = 0 gives us 3 = 4A + C, so C = 5. 

Matching the terms in x 2 gives us 0 = Ax 2 + Bx 2 , so B = -A = -j. 

Checking with x = 1 gives the LHS = 3, and the RHS = 3 + 0 = 3. 
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So 

- 3 , 3 3 

3 5 ("5* + 5 



) 3 



(x + l)(x 2 + 4) x + 1 x 2 + 4 5 \x + 1 x 2 + 4 

3 

taking out the factor of 5. Notice carefully the signs in the two forms of writing this answer. 
Remember that the line of the fraction acts as a bracket. (See, if necessary, Section l.C.(e) 
on subtracting fractions.) 



The rule for dealing with non-linear factors 

If one of the factors on the bottom of a fraction has an x 2 term, and this factor 
won't itself factorise any further, then we need both xs and numbers on the top, 
like the Bx + C above. 



Similarly, if we had a factor underneath with an x 3 term, and this factor wouldn't itself 
factorise, we would need to have Ax 2 + Bx + C on the top, and so on. 

exercise 6.E.3 Try finding partial fractions for 

14 4 

(1) , (2) . 

(x 2 + 3)(x+ 2) y(y 2 + 1) 



Top-heavy fractions 

Consider these four examples. 

x 2 + 3x - 5 x 2 + 4x - 2 x 2 + 1 x 3 + 3x 2 + 2x - 3 

(1) (2) — (3) — (4) 

x 2 + 2x - 8 x 2 + 5x + 6 x 2 - 9 (x + 2) (x - 1) 

Each of these fractions is top-heavy. By this I mean that the highest power of x on the top 
is greater than, or equal to, the highest power of x on the bottom. 

If we have this situation, it is necessary to divide before finding partial fractions for the 
rest of the expression. 

(This division is exactly the same process that we use in writing the fraction y as 2g. The 
arithmetical fraction y is top-heavy.) 

Fortunately, quite often this dividing can be done without using the full long-division 
process. 

(1) In this example, we can cunningly rewrite the top of the fraction as follows: 

x 2 + 3x - 5 x 2 + 2x - 8 + x + 3 



x 2 + 2x - 8 x 2 + 2x - 8 

This can then be written as 
x + 3 

1 + 



x 2 + 2x - 8 
Now we find partial fractions for 
x + 3 



x 2 + 2x - 8 
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This factorises to 
x + 3 
(x + 4)(x - 2) 
giving partial fractions of 

1 5 

x + 4 x - 2 

(Check this for yourself.) 

The complete solution is then given by 

x ~t~ 3x 5 z z 

= 1 + — — + — — . 

x 2 + 2x - 8 x + 4 x - 2 

^^^^ It's very easy to forget to include the 1 here. 

(2) Can you see how to rewrite the top of the fraction in example (2) to make the 
division easy? 



We can say 

x 2 + 4x - 2 x 2 + 5x + 6 - x - 8 

x 2 + 5x + 6 x 2 + 5x + 6 

This can then be written as 

x + 8 

1 ■ 

x 2 + 5x + 6 

Notice the signs again! The line of the fraction is acting as a bracket. 
Now, find partial fractions for 

x + 8 

x 2 + 5x + 6 ' 

You should have 

x + 8 x + 8 A B 

x 2 + 5x + 6 (x + 3)(x + 2) x + 3 x + 2 

so x + 8 =A(x + 2) + B(x + 3). 

Putting x = -2 gives 6 = B. 
Putting x = -3 gives us 5 = -A. 

Notice that, in this example, it is necessary to substitute for x on the LHS too. 
So the complete solution is 

x 2 + 4x - 2 / -5 6 \ 5 6 

= 1 - + = 1 + . 

x 2 + 5x + 6 \x + 3 x + 2/ x + 3 x + 2 
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There are two things to remember here: we must include the 1 like last time, and we also 
have to remember the minus sign in front of the big bracket. 

(3) Try doing this example for yourself. 



You should have 

x 2 +l x 2 -9+10 



.2 9 x 2_g 

10 10 

= 1 + = 1 + 



x 2 - 9 (x - 3)(x + 3) 

10/(x - 3)(x + 3) can then be easily split into partial fractions, giving a final complete 



answer of 

1 + 



5 5 
3 3 



x - 3 x + 3 

(4) Here, we shall have to have recourse to the full long-division process. I explained 
how to do this in Section 2.E.(b). We have 

x 3 + 3x 2 + 2x - 3 



x 2 + x - 2 
so we find 

x + 2 

x 2 + x - 2)x 3 + 3x 2 + 2x - 3 
x 3 + x 2 - 2x 

2x 2 + 4x - 3 
2x 2 + 2x - 4 
2x + 1 



Since x + x - 2 = (x + 2)(x - 1), we now have 

x 3 + 3x 2 + 2x - 3 2x + 1 

= x + 2 + 



(x + 2)(x - 1) (x + 2)(x - 1) 

You should check for yourself that this comes to 

x 3 + 3x 2 + 2x - 3 1 1 
= x + 2 + + 



x 2 + x- 2 x + 2 x - 1 

remembering to include the x + 2 in the final answer. 

The rule for dealing with top-heavy fractions 

If the fraction is top-heavy, that is, if the highest power of x on the top is greater 
than or equal to the highest power of x on the bottom, then we must divide out 
first, and find partial fractions for the remaining fraction. 
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We shan't need to use partial fractions which are as complicated as these for summing 
series, but you will need them for integration, and you are now set up for dealing with them 
when this happens. 



exercise 6.E.4 The following questions involve a mixture of the complications we have just been 

looking at. In each case, find suitable partial fractions. 

to ~. w ( 2 ) ~, v, ^ (3) 



(x + 3)(x - l) 2 (2p - + 2) 2 (2X + l) (x 2 - 6x + 9) 

10/ iox r 2 + i 

^ 7 ^71 ^ (5) 7 7 (6) ~1 

(y - i)(y 2 + 9) (x - i)(x 2 - 9) r 2 - 1 

x 4 + 1 u 2 - 1 x 2 + 1 

(7) (8) — r (9) 



X 4 - 1 U 2 (2U + l) (X + 2)(X + 4) 



2 



(10) (a) Write down the first four terms of the series X 

r = 1 4/-^ - 1 

2 

(b) Factorise 4r 2 - 1 and then use this to find partial fractions for 



n 2 V ~ 1 



(c) Now use these to find X 

r = 1 4r 2 - 1 

(d) What is the sum to infinity for this series? 

6.F The fate of the frog down the well 

In this last section, we return to the series + | + j + which describes the attempts 
of the frog to escape from the well in the thinking point of Section 6.C.(k). What I was really 
asking you there was whether this series is convergent or divergent. If it is divergent then, 
however deep the well, the frog will eventually escape. If it is convergent, then it must be 
possible to find a depth D so that anything deeper than this spells his doom. (D wouldn't 
necessarily have to be the sum to infinity of the series - this could well be tricky to find. It's 
like the headroom of a bridge: if a lorry crashes into it we know that anything higher than 
the lorry certainly won't get through, and we know this without having measured the exact 
headroom of the bridge.) Even if this series is convergent, there will be some depths which 
the frog can escape from, just like most cars can probably go safely under the bridge. 

We know that four jumps are sufficient to escape from a well which is 2 metres deep. 
Adding up the terms on a calculator, it is quite easy to discover that 3 1 jumps are sufficient 
if the well is 4 metres deep. We also know that each individual jump is getting smaller and 
smaller the more jumps the frog makes. 

Is knowing this sufficient for us to say that this series must converge towards some 
particular sum? (We know from Section 6.C.(c) that it would be enough in the case of a GP 
because, if the terms get smaller, then its common ratio must be less than 1 and therefore 
it will have a sum to infinity.) 

Might it help us here if we find the ratio of successive terms? We can see that, as n 
becomes large, there will be very little difference between \ln and \l(n + 1), although each 
of them separately is also becoming very tiny. We can say that 

u„ + j \l(n + 1) n 1 

u n \ln n + 1 1 + \ln 

(We did this same sort of thing when we were graph-sketching in Section 3.B.(i).) 
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Now, since - becomes closer and closer to zero the larger n becomes, this ratio gets closer 
and closer to 1 . This still leaves us in a bit of a quandary. The terms are getting more and 
more equal but they are also getting exceedingly tiny. Which will win? 

Mathematicians have actually shown that, if the terms of a series are positive, and if the 
ratio of successive terms gets closer and closer to some number less than 1 , then the series 
is convergent. If this ratio gets closer and closer to a number greater than 1 then the series 
is divergent. But if the ratio is equal to 1 , we need to do more investigation. 

Figure 6.F.1 gives a picture of what is happening as the number of jumps increases. I have 
laid them out sideways to fit them into the space better. The full height travelled is what we 
get if we place all these lines on top of each other, including the ones which will be too small 
to see, but which go on for ever. 

J >- The height of each jvmp in metres 




The number gives the particular Jui^tp in the je^enee. 

Figure 6.F.1 



There is a very neat way of showing what happens in the case of this series. It goes 
like this: 

Since all the terms are positive, we can reasonably group them in any way we please, 
because where we add bits on makes no difference to the total result. Every term you add 
on is moving you in the same positive direction, so each of these forward steps will have the 
same effect wherever it is placed. 

So we can say 

1111111 

1 + — + — + — + — + — + — + — + ... 
2 3 4 5 6 7 8 
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1 1 1 1\ 

— + — + — + —+ ... 
5 6 7 8/ 

1 1 1 1\ 

— + — + — + —+ ... 
8 8 8 8/ 



1 1 1 

that is, >1+- + - + - + ... 

2 2 2 

Clearly, this second series is divergent since we can make the sum as large as we like by 
taking enough terms. Therefore, the first series must also be divergent, and the frog does 
eventually escape. Actually, although mathematically his escape is assured, practically his 
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situation is not very rosy. After 1000 jumps he has still only gone about 7j metres. This series 
is very close to the convergence/divergence divide. Its true name is the harmonic series. 
Each term is related to a different mode of oscillation of a stretched string, with 1 
corresponding to the fundamental mode or first harmonic. Oscillation modes are important 
in all oscillating systems including the strings of musical instruments, which explains the 
use of the word 'harmonic'. 

In working out what happened in the case above we were able to compare the series we 
got by grouping the terms of the original series with the behaviour of a known series. Such 
comparisons make a very good method of attack on series which we can't easily sum, but 
we have to be very pernickety about when we can rearrange or regroup the terms of a 
series. 

We have already met the curious case of the flip-flop series in question (l)(e) of Exercise 
6.C.1 in Section 6.C.(f). 

This goes 1-1 + 1-1 + 1-1 + 1- .. . and its sum alternates between 0 and 1 depending 
on whether we've taken an odd or even number of terms. This series is divergent. It's 
important that 'divergent' doesn't necessarily mean that the sum gets larger and larger the 
more terms you take, though it does describe this possibility. 'Divergent' means any series 
which isn't convergent, and so doesn't have a sum to infinity. 

We can only rearrange or regroup the terms of an infinite series if they are all positive. 
(You can do what you like with a finite number of terms of any series - the order you add 
the terms in will make no difference to that particular total.) Once we start letting the series 
go on endlessly we find that the obvious is not always true. 

You might think that it would be safe to group the terms in brackets in a series where the 
individual terms are becoming smaller, and which is known to be convergent, even though 
these terms alternate in sign. 

The series 1 — 2 + 3 — 4 + 5 — 6 + - •• i s convergent. We'll find in Example (4) of Section 
8.G that its sum is equal to In 2. 

Now have a look at the following apparently plausible steps of working. 

1 0=1 I.I I.I I.I I 

in/ 1 2 i "3 4 i "5 6 + 7 8 • 1 • 

= 1 - 5- 5 + l- i- s + 5- i7)-n + -- - well > wh y not? 

= (1 - b - \ + (| - 1) - | + (5 - m> - • • • hmm . . . 
-I 1,1 I.I _L 

~2~4 + 6 - 8 + 10 - 12--- 

= \{\ - \ + \- \ + \- \ . . . = \\n2. a minefield! 

It is because of unexpected and curious results like this that mathematicians have had to 
investigate what actually happens so carefully. Since series are deeply involved in many 
practical applications, knowing what can and can't be done with them is very important. For 
these purposes, it may often only be necessary to consider what happens when you take a 
limited number of terms, but you need to know when it is safe to do this. It is the difference 
between taking a permitted liberty and sailing ahead without noticing the warning signs. 
Mathematically, as well as socially, this can lead to disaster. 
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Binomial series and proof by 
induction 



In this chapter we find out how to do binomial expansions, and see how they can 
describe some real-life situations. We also look at a new method of proving 
mathematical statements. 

The chapter is divided into the following sections. 

7.A Binomial series for positive whole numbers 

(a) Looking for the patterns, (b) Permutations or arrangements, 

(c) Combinations or selections, (d) How selections give binomial expansions, 

(e) Writing down rules for binomial expansions, 

(f) Linking Pascal's Triangle to selections, (g) Some more binomial examples 

7.B Some applications of binomial series and selections 

(a) Tossing coins and throwing dice, 

(b) What do the probabilities we have found mean? 

(c) When is a game fair? (Or are you fair game?) 

(d) Lotteries: winning the jackpot ... or not 

7.C Binomial expansions when n is not a positive whole number 

(a) Can we expand (1 + x) n if n is negative or a fraction? If so, when? 

(b) Working out some expansions, (c) Dealing with slightly different situations 

7.D Mathematical induction 

(a) Truth from patterns - or false mirages? 

(b) Proving the Binomial Theorem by induction, 

(c) Two non-series applications of induction 



7.A Binomial series for positive whole numbers 

7.A.(a) Looking for the patterns 

The first half of this chapter describes what are called binomial series. I have given them so 
much space because they have many applications. For this reason it is important that you 
should be able to do binomial expansions correctly and happily. The word 'binomial' comes 
from the two quantities put together in a bracket which we start from. Binomial expansions 
are what we get when we raise these brackets to different powers and then multiply the 
brackets together to find the result. In this first section all these powers will be positive 
whole numbers. 

Here are some examples. 

(a + b) 1 is just a + b 

(a + bf = {a + b)(a + b) = a 2 + lab + b 2 . 

The lab comes from the two middle terms of ab which add together because it doesn't 
matter what order we multiply a and b in. 
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Next comes 



(a + b) 3 = (a + b)(a + b)(a + b) = a 3 + 3a 2 b + 3ab 2 + b 3 . 

We find the answer by picking one letter from each bracket in every possible way and 
then multiplying these choices together. 

There is only one way of getting a 3 and b 3 . 

The a 2 b term comes in three ways, as we can choose the b from any of the three brackets, 
and then multiply it with the a terms in the other two brackets. Similarly, ab 2 can be made 
in three possible ways. 

What happens with 

(a + bf = (a + b)(a + b)(a + b)(a + b)l 

There will be just one a 4 and just one b 4 . There will also be some numbers of terms for each 
of a 3 b, a 2 b 2 and ab 3 . 

Because the a and the b are symmetrically placed in the brackets, there must be the same 
number of terms in a 3 b as there are in ab 3 . 

There will be four of each since we can pick either a single b or a single a in four different 
ways from the four brackets. 

The six possibilities for a 2 b 2 are given by aabb, abba, abab, baab, baba and bbaa. 

We see that by multiplying the four brackets together, we get 

(a + b) 4 = a 4 + 4a 3 b + 6a 2 b 2 + 4ab 3 + b 4 . 

Now we ask two questions. 

Firstly, is there an easier way than this of finding, for example, the 6a 2 b 2 term? 

Secondly, is there a general pattern building up from these results? 

If we write down how many we have of each possible combination of as and bs for all 
the brackets which we have multiplied out so far, we get the four lines of numbers written 
out below, which make a kind of blunt-topped triangle. 

1 1 
1 2 1 
13 3 1 
1 4 6 4 1 

These numbers give the coefficients for the different combinations of as and bs. 
Can you see what the next line of it will be? 



It is 

1 5 10 10 5 1 

Each number in each row is found by adding the two numbers nearest in the line above. 
If it is at the end of a row, the single number closest to it is used. 

We can use the row which we have just worked out to write down the expansion of 
(a + b) 5 . It is 

(a + b) 5 = a 5 + 5a 4 b + 10a 3 b 2 + 10a 2 Z> 3 + 5ab 4 + b 5 . 

This triangle, which gives the various different sets of binomial coefficients, is called 
Pascal's Triangle, after the French mathematician who first observed it, Blaise Pascal. 
Provided the power is not too high, it is the easiest way of working out what the coefficients 
will be. 
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EXERCISE 7.A.I 



Write down, by extending this triangle, the expansions of 
(1) (0 + b) 6 (2) (o + b) 7 



I've put the answers in straight away because they show something important. You should 
have 

(1) a 6 + 6a 5 b+ I5a 4 b 2 + 20a 3 b 3 + I5a 2 b 4 + 6ab 5 + b 6 

(2) a 1 + la 6 b + 2\a 5 b 2 + 35a 4 b 3 + 35a 3 b 4 + 2\a 2 b s + lab 6 + b 1 . 

Notice how the power of a moves down by 1 and the power of b up by 1 for each new 
term. The powers together add up to 6 for (1) and 7 for (2). 

We will now get some practice in the mechanics of binomial expansions in which the V 
and the '6' are replaced by more complicated expressions. (These often form part of the 
working of longer problems, and it is important that you should be able to do them 
confidently and accurately.) 

We'll work out (2x + 3yf as an example. 

Here, the 'a' is 2x, and the '6' is 3y, and n = 6. 

We get the binomial coefficients by using the sixth line of Pascal's Triangle. This is 

1 6 15 20 15 6 1. (P6) 

I've labelled it (P6) so I can easily refer back to it. 
The expansion goes 

(2x + 3yf = {2xf + 6(2x) 5 (3y) + 15(2x) 4 (3y) 2 

+ 20(2x) 3 (3y) 3 + 15(2x) 2 (3y) 4 + 6(2x)(3j) 5 + (3yf. 

Notice again the pattern of the powers. They move down by 1 each time for the V and up 
1 each time for the 'Z?' of the expansion. 

Added together, they always give n, the overall power we are calculating. 

Multiplying out, we have 

(2x + 3yf = 64x 6 + 576x 5 j + 2160x 4 j 2 + 4320x 3 y 3 + 4860x 2 y 4 + 2916xj 5 + 
729/. 

^^^^ Don't forget the part of each coefficient which comes from the 'a' and the 
^H-^V '/?' raised to the various different powers. Students very frequently make 

mistakes here. It is safer always to put brackets round the whole of the 'a' 

and the '6' as I have done above. 



exercise 7.A.2 Try expanding these for yourself. 

(i)(x- 2 y) 6 (2)(2x 2 -y 2 )5 ( 3 )^2x-^ 4 (4) ^ + 4* 2 

7.A.(b) Permutations or arrangements 

The pattern shown in Pascal's Triangle is very neat and, as we have seen, is very useful for 
writing down the answers for binomial expansions when the power is not too large. It would, 
however, be rather tedious to have to go much further than (P7) and we look now at how we 
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can find a general rule to give us these results. (This will also explain why we get this pattern 
in the first place.) 

To do this, we will look at the numbers of different possibilities of choosing some objects 
from a larger number of objects. We know that when we multiply out the brackets the order 
of the letters doesn't matter, so, for example, both aba and baa count as a 2 b. It's actually 
easier to find a general rule for what happens when the order of choice does matter, so we'll 
look at some examples of this first. 

Because it can make it easier to see what is happening if we look at it pictorially, and 
because the total number of choices quite quickly becomes amazingly large as we increase 
the possibilities, we will start with a relatively simple situation. 

Let's consider the number of possible choices of three counters from four differently 
shaped counters, and let's also suppose that the order of choice matters. Then the first 
counter can be chosen in four ways. The second one can be chosen in three ways from the 
three which are now left, and the third counter can then be chosen in two ways. This gives 
us a grand total of 4 X 3 X 2 = 24 choices. 

All the possibilities are shown in Figure 7.A.I. 



□ OA 


□AO 


OdA 


OAD 


AOD 


AdO 


□ Of? 


DEPO 


!?no 


I?OD 


OPD 


ODtP 


□ AC? 


DE?A 


(PDA 


PAD 


APD 


ADtP 


OAF 

Figure 7.A.1 


OtPA 


FOA 


t?AO 


AFO 


AOD 3 



Here is another example. 

Suppose there is a class of ten children and six of them will be given a prize. It is not 
allowed for any child to have more than one prize, and six different books have been bought 
for the purpose. We'll also suppose that these prizes are being handed out randomly - no 
awards for merit here! 

The child who gets the first book may be chosen in ten ways. For each of these ten 
choices, there are nine ways of choosing the child to get the second book. Then, for each of 
these choices, there are eight ways of choosing the third child. The total number of choices 
of the six fortunate children is given by 10 X 9 X 8 X 7 X 6 X 5 = 151 200 which is a 
surprisingly large number. The order of choice of the children matters because the books are 
all different so the same six children chosen in a different order will count as a different 
choice, since they would each then get different books. 

We can use the fact that the numbers are running down by 1 each time to write the total 
number of ways of distributing the prizes in a very neat compact form. We let the top run 
right down to 1 and then divide this by the extra part on the bottom (so that cancelling would 
bring us back to the original multiplication). 

We can then say that this total number is 

10X9X8X7X6X5X4X3X2X1 

10X9X8X7X6X5 = 

4X3X2X1 

_ 101 
4! ' 
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The symbol ! is used for multiplications like these. The 10! above is called 'ten factorial'. 
(Factorials came in also when we looked at series (1) in Section 6.A.(a).) 

The expression 10!/4! gives the number of permutations or arrangements of six objects 
(or people) chosen from ten objects (or people). 

We can see that it must be 4! on the bottom by noticing that 

4=10 (the total number we chose from) - 6 (the number of choices we are making). 

fFor permutations or arrangements, the order of choice matters. A different 
order gives a different arrangement. 



The number of permutations or arrangements of r objects from n objects is given by 
n\ 

(n - r)\ 



7.A.(c) Combinations or selections 

How much difference will it make if we have a situation in which we don't care what order 
the choices are made in? 

Returning first of all to the example of choosing three counters from four differently 
shaped counters, if the order of choice isn't important, how many different possibilities are 
there? 



There are only four. 

These are shown in Figure 7.A.2. (Any order would have done equally well.) 

□ OA DOC DAI? OAEP 

Figure 7.A.2 

If you now look back at the 24 possibilities shown in Figure 7.A. 1 . you will see that these 
are the four different possibilities shown in the left-hand column. Each row is then made up 
of the different arrangements of that particular choice of three counters, and there are six of 
each because each possible set of three counters was shown there in all its different orders. 
So there were three different choices for the first counter, two for the second and just one 
for the third giving 3 X 2 X 1 = 3! = 6 for each group of three counters. 

The total number of choices of three counters from four counters, if we don't care about 
the order of choice, is given by 

24 4! 

6 1!3!' 

We have to divide the total of 24 by 6 or 3! to get rid of all the different internal 
arrangements of each group of three counters, which we aren't interested in this time. 
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We can take a second example by looking again at the different ways in which the 
children can receive their prizes. 

Suppose this time that six identical copies of the same book had been bought for the 
prizes. The order of choice of the children no longer makes any difference because all six 
are getting the same book anyway. 

The number of different choices is now given by the number of different groups of six 
children. To find these, we no longer need to take account of the order in which any 
particular group was chosen. 

So we must divide our previous total of 10!/4! by 6! to get rid of all these unwanted 
internal different orderings. 

This gives us that the number of combinations or selections (that is, choices in which 
the order of choice doesn't matter) of six people from ten people, is 

10! 

6! 4! ' 

This is sometimes called 'ten pick six' or 'ten choose six'. 

I 

M ^ For combinations or selections, the order of the choices made does not 
V m matter. If the same objects are chosen, it makes no difference which one was 
chosen first, which second etc. 



The number of combinations or selections of r objects from n objects is given by 
n\ 

r\ (n - r)\ 



This is sometimes written as n C r or 




The number of ways of picking n objects from n objects if the order of 
choice doesn't matter, is just 1. Using the rule above, we would have 

n\ 

= 1. 

n\ 0! 



In order to make this rule work we say that 0! = 1. 



7.A.(d) How selections give binomial expansions 

We now link the work we have just done on selections back to what we saw was happening 
with binomial expansions. The procedure in these expansions is that we are choosing one of 
two possibilities from each bracket, then multiplying these choices together and finally 
grouping together all the similar results. 
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For example, we look again at finding (a + b) 4 = (a + b)(a + b)(a + b)(a + b). 
It's easy to see that all the as can be chosen in only one way, giving a 4 . 
Similarly, all the bs can be chosen in only one way, giving b 4 . 

Three as and one b can be chosen in four ways since the single b can be chosen from any 
of the four brackets and the other three will then necessarily be as. This gives us 4a 3 b. 
Similarly, three bs and an a can be chosen in four different ways, giving us 4ab 3 . 
Finally, in how many different ways can we choose two as? 

We are choosing two as from four as and the order of choice doesn't matter, so this can 
be done in 4!/2! 2! = 6 ways. 

We have found the 6 without using either Pascal's Triangle or having to draw the six 
possibilities. 

In exactly the same way, suppose we want to find the term in a 5 b 11 in the expansion of 
(a + bY 6 . 

The power here is of such a size that we wouldn't really want to have to extend Pascal's 
Triangle this far. (Besides, we only want one term.) 

We think of the term we want as giving the number of ways of choosing five as from 16 
as if the order of choice doesn't matter. 

16! 16 X 15 X 14 X 13 X 12 

This is given by = = 4368. 

5! 11! 5X4X3X2X1 

Since we must choose one letter from each bracket, choosing five as means that we must 
also have 1 1 bs so, equally, we could have said that this term would be given by the number 
of ways of choosing 11 bs from 16 bs. This is 

16! 

= 4368 as before. 

11! 5! 

In each case, once a certain number of one letter has been chosen, we know that the gaps must 
be filled by the other letter, so we don't have to worry about making choices for that. 

exercise 7.A.3 We have just found that the coefficient of the term in a 5 i>" in the expansion of 

(a + b) 16 is 161/5! 11! = 4368 so the term is 4368o 5 b". 

Find the coefficients of the following terms in the same expansion, giving your 
answers both in factorial form and as numbers. 

(1) 0 16 (2) a^b (3) a^b 2 (4) a 12 M (5) a s b 8 

(6) a*b 12 (7) a 2 b^ (8) b 16 (9) a r b l6 ~ r 

In each case, say also what the actual term would be. 

7.A.(e) Writing down rules for binomial expansions 

We can use the results which we have found in this exercise to write down the whole 
expansion of (a + b) 16 as follows: 

16.15 16! 

(a + b) 16 = a 16 + 16a 15 6 + a 14 b 2 + ...+ a 16 - r b r + . . . + b 16 . 

2! r!(16-r)! 

(The . . . stands for missing terms in the same way that we used it in Chapter 6.) 
We could also use the 2 notation which we met in Section 6.D, and write 

I 6 16! 

(a + b) 16 = I a l6 - r b r . 

r = 0 r!(16 - r)! 

Notice that we start with r = 0 so that we have a 16 and b° = 1 in the first term. 
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If n is a positive whole number, we can write down this rule for the binomial 
expansion of (a + b) n : 

n(n - 1) n(n — \){n — 2) 

(a + b)" = a" + na n - l b + a"- 2 b 2 + a"- 3 b 3 + . . . 

21 3! 

n\ 

a"- r b r + . . . + b". (Bl) 



rl(n - r)\ 



If you put n = 16, you will get the example of (a + b) 16 which we have just done. 

I have always found it best to remember the binomial expansion in the way in which I 
give it here, with the first three terms in their cancelled down form, because this is the 
easiest form to feed into, if you want to work out just the first few terms of a particular 
expansion. 

Have a go at one yourself, now. 

Try using the rule above to write down the expansion of (a + b) 5 . 
You will need to put n = 5. 



You should get: 

, U,5 5 c 4, 5(4) 3,2 5 < 4 X 3 ) 2,3 5 ( 4 X 3 X 2 ) ,4 , 5 

(a + b) 5 = a 5 + 5a 4 b + a 3 b 2 + a 2 b 3 + ab 4 + b 5 

2! 3! 4! 

so (a + b) 5 = a 5 + 5a 4 b + \0a 3 b 2 + I0a 2 b 3 + 5ab 4 + b 4 

which gives the same result as using Pascal's Triangle. 

In many circumstances, it happens that the first term in the bracket (which we called a 
above) is 1. 

Then, putting a = 1 and b = x to avoid confusion between the two forms, we get: 



n nifi — 1) 

(1 + x) n = 1 + — x + x 2 + . . . + : x r + . . . + x". (B2) 

1! 2! r\{n-r)\ 



I've included the 1! in the second term to keep the pattern of the factorials running 
through. We'll need this later on in Section 8.B.(a) when we take another look at e. 
Notice also that the second term has x and the third has x 2 , so 



n\ 

the term x r is actually the (r + l)th term. 

r\(n — r)\ 
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Similarly, in (Bl), the general term 



a" r b r is actually the (r + l)th term. 

r\ (n - r)\ 



When we wrote the series using 2 we made the sum run from zero to n, so there are n + 1 
terms altogether. 

Here is an example which uses the formula (Bl). 
Write down the first four terms of the expansion of (2x - jj) 12 - 
The value of n here is so large that it would be tedious to continue Pascal's Triangle as 
far down as we would need. 

Instead, we use form (Bl), putting V = 2x, 'b' = - \y and n = 12. 

^^^^ Remember that the minus sign must be included as part of '/?'. 

Substituting in these values, we have for the first four terms of the expansion 

„ , 12 X 11 , 12 X 11 X 10 . 

(2x) 12 + 12(2x) u (-\y) + (2x) 10 (-\yf + — — (2x) 9 {-\yf. 

2X1 3X2X1 

Tidying up these first four terms, we get 

4096x 12 - 12288x u y + 16896x 10 j 2 - 14080x 9 j 3 . 

exercise 7-A.4 Now try these for yourself. 

Write down and simplify the first four terms in the expansions of 
(i) (2x - y) 12 (2) (i - 2xY s ( 3 ) (i + x 2 ) 10 (4) £ x + 3 y) 16 



7.A.(f) Linking Pascal's Triangle to selections 

We are now in a position to be able to see comfortably how the links work between Pascal's 
Triangle and the selections which give the coefficients, using formula (B2). We use (B2) 
because it makes it a bit easier to see what is going on, but (Bl) would work in exactly the 
same way. 

We begin by writing down the eighth row of Pascal's Triangle, giving the coefficients in 
the expansion of (1 + x) 8 . I have labelled it (P8). It is: 

1 8 28 56 70 56 28 8 1 (P8) 

Try answering the following questions, and then we'll look at them together. 

(1) Use (P8) to write down the next row of the triangle, giving the coefficients for the 
expansion of (1 + x) 9 . Label it (P9). 

(2) Using (P8), write down the coefficients of (a) x 4 and (b) x 5 in the expansion of 
(1+x) 8 . 
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(3) In factorial form, the coefficient ofx 4 in the expansion of (1 + x) 8 is 8!/4! 4!. Write 
down the coefficient ofx 5 in factorial form. 

(4) Using (P9), write down the coefficient of x 5 in the expansion of (1 + x) 9 . 

(5) Now write down the coefficient of x 5 in this expansion in factorial form. 



Here are the answers. 

(1) 1 9 36 84 126 126 84 36 9 1. (P9) 

(2) The coefficient of x 4 in (P8) is 70. The coefficient of x 5 is 56. 

(3) The coefficient of x 5 from (1 + x) 8 in factorial form is 8!/5! 3!. 

(4) From (P9), the coefficient of x 5 in the expansion of (1 + x) 9 is 126. 

(5) The coefficient of x 5 in this expansion in factorial form is 9!/5! 4!. 

Now we try answering this question. 

We used 70 + 56 in (P8) to get 126 in (P9). 

Obviously this must also be true written in factorials, so 

8! 8! 9! 

1 must equal . 

4! 4! 5! 3! 5! 4! 

We now show that this must be true by factorising and tidying up the first two fractions. We 
have 

8! 8! 8! / 1 1 \ 

4! 4! 5! 3! 4! 3! \1 X 4 5X1/ 

(Check this step for yourself by multiplying it back. You'll need to use 4 X 3! =4! and 
5X4! = 5!) 

8! /5 +4\ 8! X 9 9! 

4!3! \4 X 5/ (4! X 5)(3! X 4) 5! 4! ' 

(This step involves adding fractions as we did in Section l.C.(c).) 

We can also see that this must happen if we think of (1 + x) 9 as coming from 
(1 + x) (1 + x) 8 . Then the term with x 5 in (1 + x) 9 comes from 
1 X the term in x 5 from (1 + x) 8 + x X the term in x 4 from (1 + x) 8 . 

exercise 7.A.5 With the above example to look back at, you should be able to answer the 

following three questions yourself. 

You first have to fill in the gaps marked with asterisks (*), and then combine 
the factorials. 

9! 

(1) The coefficient ofx 3 in the expansion of (1 + x) 9 is . (a) 

3! 6! 

*! 

The coefficient ofx 4 in the expansion of (1 + x) 9 is . (b) 

*! *! 

10! 

The coefficient ofx 4 in the expansion of (1 + x) 10 is . (c) 

*! *! 

Show, by factorising and tidying up, that (a) + (b) = (c). 
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*! 

(2) The coefficient of x 3 in the expansion of (1 + x) 12 is . (a) 

3! 9! 

*! 

The coefficient of x 4 in the expansion of (1 + x) 12 is . (b) 

*! *! 

*! 

The coefficient of x 4 in the expansion of (1 + x) 13 is . (c) 

*! *! 

Show, by factorising and tidying up, that (a) + (b) = (c). 

k\ 

(3) The coefficient of x r_1 in the expansion of (1 + x) k is . (a) 

(r- 1)! (k - r + 1)! 

*! 

The coefficient of x' in the expansion of (1 + x) k is . (b) 

*! *! 

*! 

The coefficient of x' in the expansion of (1 + x) k+1 is . (c) 

*! *! 

Show, by factorising and tidying up, that (a) + (b) = (c). 

7.A.(g) Some more binomial examples 

Here are three more examples showing ways in which we can pick out particular terms. 

example (1) Write down the term containing (&)p 6 , (b) q 6 , in the expansion of (p - 2q) 14 . 

To do this, we can use the expression for the general term in form 
(Bl). This is 

n\ 

a"- r b r . 



r\ (n - r)\ 

(Remember that this is the (r + l)th term of the series, not the rth term.) 

Here, n = 14, V = p, '6' = -2q and the term in p 6 is given when 
«-r = 6sor = 8. 

14! 

The term in p 6 is p 6 {-2qf = 768768d V- 

8! 6! 

14! 

The term in q 6 is p 8 {-2qf = \92\92p* q b . 

6! 8! 

Notice the symmetry of the binomial coefficients: 
14! 14! 



8! 6! 6! 8! 

/ 3\ 12 

example (2) Find the constant term in the expansion of Ax + — J . 

This is the one term in the expansion which is purely a number, and 
so doesn't depend upon the value of x for its size. It happens because 
the powers of x in this expansion are cancelling each other out to some 
extent on each term. 

Can you work out for yourself when it will be that they will cancel 
out exactly? 
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The term we want will involve (4x 2 ) 4 (-) 8 , so it is 

12! , 

(4x 2 ) 4 © 8 = 831409 920. 



8! 4! 



example (3) Find the term in x 11 in the expansion of (1 - x) 8 (3 + 2x) 5 . 

The complication here is that the term in x 1 1 arises from three 
different multiplications of pairs of terms, because x 1 1 can come from 
x 8 X x 3 and x 7 X x 4 and x 6 X x 5 . 

Any other combinations are impossible from this particular pair of 
brackets. 

We need to write down the terms of these separate multiplications 
fully in order to work out the complete term in x 1 1 . We get 



(-x) 8 



5! 



2! 3! 



(3) 2 (2x) 3 



8(-x) 7 



5! 



1! 4! 



(3)(2x) 4 



+ 



2! 6! 



(-x) 6 



(2x) 5 



Each separate part of the three terms we have added together here is 
enclosed in square brackets to make it easier for you to see how each 
bit has been worked out. 

Now, tidying up the above working, we get 

720X 11 - 1920X 11 + 896x" = -304x". 



EXERCISE 7.A.6 



Try these questions yourself. 

(1) Find the term in x 6 in the expansion of 

(a) (2-3X) 11 (b) (2x-y) 8 (c) (y 2 - 2X 2 ) 1 

(2) Find the constant terms in the expansions of 



3 \ 10 / 1 \ 9 / 1 



(a) 2x- 



(b) x + 



(c) 2X3 + 



X/ \ X'/ \ X/ 

(3) Find the term in x 10 in the expansion of (1 + x) 7 (2 - 3X) 5 . 



7.B Some applications of binomial series and selections 

7.B.(a) Tossing coins and throwing dice 

Binomial expansions can be applied very neatly to describe the likelihoods of the different 
possible outcomes to some events involving chance. When you do a binomial expansion, you 
are making a free choice of which of two terms to pick in each of the equal brackets, and 
then writing down all the different possible results. This fits any real-life situation in which 
there are repeated events, each of which has just two possible outcomes, and where the 
outcome of one event doesn't have any effect on subsequent events. 

For example, suppose you toss a fair coin. The likelihood or probability of getting a head 
is 2- ('Fair' here means that it is equally likely to fall heads or tails.) 

What will be the likelihood or probability of each of the different outcomes if you toss 
the coin three times instead? 



272 



Binomial series and induction 



We can show all these probabilities by writing the binomial expansion 

(jT+ \Hf = (\T) 3 + 3 (\Tf {\H) + 3 (jT)(\H) 2 + {\Hf. 

I have used H and T as markers for heads and tails, and the two halves in the first bracket 
stand for the probabilities of each of these on a single toss. Tidied up, we get 

\t 3 + \t 2 h + Ith 2 + Ih 3 . 

This carries all the information on the possible outcomes of the three trials, that is, 

• a probability of | of getting three tails, 

3 

• a probability of g of getting two tails and one head, 

3 

• a probability of g of getting one tail and two heads, 



a probability of g of getting three heads. 



This idea can be extended to situations where the outcomes on each trial aren't equally 
likely. Suppose you throw three dice and you want to know the probabilities of getting the 
different possible numbers of sixes. The probability of getting a six on a single throw of a 
fair die is one sixth because there are six possible equally likely outcomes, and only one of 
them gives a six. The probability of not throwing a six is g. If I use markers of P (for success 
in throwing a six) and Q (for throwing a different score) then I can show the probabilities 
for all the different outcomes by writing 

(ffi + IP? = (f 0 3 + 3(f 0 2 (zP) + 3(f 0 {\P) 2 + (IP) 3 



So 



125 ^ 75 ^ 15 „ 1 , 

= Q 3 + Q 2 P + QP + P 3 . 

216 216 216 216 

the probability of getting three sixes is 
the probability of getting two sixes is j^, 

75 

the probability of getting one six is ju>, 

125 

and the probability of getting no sixes is jTE- 

Notice that all the probabilities added together give 2lf = 1 • We are certain that the dice 
will fall in one of these ways. (This makes a useful check on the arithmetic.) 

I only listed the probabilities of the outcomes of three trials in each of my examples. It 
wouldn't be too hard to work these out by drawing tree diagrams or listing all the possible 
equally likely outcomes (remembering that, for example, you can get just one tail in three 
different ways because there are three coins). The strength of the binomial expansion is that 
it works equally well for some huge number of dice where it would be hideously tedious to 
write down all the possible outcomes. It would also work equally well in forecasting the 
likelihoods of the numbers of faulty items off a production line in batches of a given size, 
provided the probability of any one item being faulty remained constant. Once you 
understand the mathematical structure of a model, you can apply it in a vast range of 
situations which are similar mathematically, though physically they are very different. 

7.B.(b) What do the probabilities we have found mean? 

What does it actually mean when we say, for example, that the probability of getting two 
sixes if we throw two dice is 5^? 

It does not mean that if we throw two dice 36 times then there will be exactly one double 
six. We know from our own experience that this can't be so. What it does mean is that if we 
throw two dice a very large number of times then the proportion of double sixes will be 
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roughly 1 in 36. (It will get closer to 1 in 36 the larger the number of trials we make; yet 
another example of tending to a limit!) 

It is important that, in all these examples, what we have found are only theoretical 
probabilities which give us the likely ratio of the different outcomes in a very large number 
of trials. 

It is possible, for example, to get 12 heads in a row if you toss a coin, but both common 
sense and the theoretical probability of (|) 12 of this happening, tell you that it is very 
unlikely. You would begin to suspect that you might have a double-headed coin. 

Usually, the study of statistics tells us not whether something is possible or impossible, 
but how likely it is. Also, as we have just seen, these likelihoods can be found exactly. If the 
observed outcomes are, for example, much more frequent than their theoretical probability 
we are warned that further investigation is sensible. Perhaps all is not as it seems. 

These ideas are developed further in the study of statistics, in which such arguments 
(leading to tests of significance) can be made on a precise mathematical basis, rather than 
woolly feelings that something is wrong. These feelings may well be correct but a careful 
statistical test can make it possible to argue the case backed up by sound mathematical 
reasoning. 

7.B.(c) When is a game fair? (Or are you fair game?) 

This is a good point at which to introduce the idea of a 'fair' game. If a game is fair in the 
mathematical sense then it must be designed so that, over a very large number of goes, none 
of the contestants is expected to make a profit over the others. So, for example, if we toss 
a coin with you paying me £1 for a head, and me paying you £1 for a tail, then on average 
we will end up with neither of us gaining from the other. We have an equal probability of 
winning overall, even though, on three goes say, I may be lucky with three heads in a row. 
However, I can't play this game expecting to win money from you. 

But casinos and lotteries aren't fair in this sense. Clearly, they can't be, because they 
make profits for the people who run them. The probabilities are built in to be unequal from 
the start, and they are only fair in the sense that each contestant other than the banker or 
owner has an equal chance of winning on each attempt. 

7.B.(d) Lotteries: winning the jackpot ... or not 

Let's now consider one other practical application of these ideas before we go on to the next 
section. 

Suppose that the rules of a lottery say that in order to win the big prize or jackpot six 
numbers must be chosen correctly in the range from 1 to 49. 
What is the probability of actually doing this? 

There are 49 equal choices which can be made for the first number. Each number in the 
range can only be chosen once, so although the first choice is made from 49 numbers, the 
next is from the remaining 48, and so on. It is exactly the same kind of situation as when 
we were giving out the six identical prizes in Section 7.A.(c). The total number of choices 
is given by 

49! 

= 13983816. 

6!43! 

(We are using combinations here rather than permutations because the order of choice does 
not matter. For example, one person might choose 42 first, and another person, with the 
identical final choice of six numbers, might have had 42 as his second chosen number.) 
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So the probability of winning the jackpot in this lottery would be 1/13 983 816. In an 
astronomical number of tries, you could expect to win it roughly once in every fourteen 
million attempts. 

exercise 7.B.1 Try answering the following questions. 

(1) Choose six numbers in the range from 1 to 49 as randomly as you can without 
using any help like the random number generator on a calculator. Now repeat 
this nine more times. Use squared paper to show your choices on a grid which 
is 49 squares wide and 10 squares deep. Do you think your choices look really 
random? Feel free to alter them if you want to. 

(2) In a lottery like the one described in the previous section, which of these 
three choices of six numbers would be most likely to win you the jackpot? 

(a) 1, 2, 3, 4, 5, 6 (b) 2, 14, 2i, 29, 33, 45 (c) 44, 45, 46, 47, 48, 49 

(3) Would there be any good reason for picking one group rather than the other two? 

(4) What would be the probability of guessing at least one number correctly in a 
lottery like this? Write down what you think it might be, and then work out 
how near your estimate is to the true answer. Hint: work out how many ways 
there are of choosing all six numbers completely wrongly. 

7.C Binomial expansions when n is not a positive whole number 

7.C.(a) Can we expand (1 + x) n if n is negative or a fraction? If so, when? 

All the arguments we have used to justify the binomial series have depended on having a 
factor multiplied by itself a whole number of times. 

It would be interesting and useful if we could extend this. Can we make any sense of 
something like an expansion of (1 + x) -1 , for example? 

We certainly can't give it the same kind of meaning which we could when we had a 
positive whole number power; then, we could actually lay out the brackets to make our 
choices. However, we'll persevere and see what would happen in an experimental kind of 
way, taking the particular case of (1 + x) -1 . 

We know that we can certainly write (1 + x) -1 as 1/(1 + x). Now let's see what happens 
if we try using the (B2) expansion from Section 7.A.(e) on (1 + x) _1 , putting the n of this 
formula equal to -1. We shall get 

„ w , ("I) (-1X-2) 2 (-1X-2X-3) 3 (-l)(-2)(-3)(-4) 

(1 + x) _1 = 1 + x + x 2 + x 3 + x 4 + . . . 

1 2X1 3X2X1 4X3X2X1 

The first thing that we notice is that the countdown on the top of the fractions isn't going 
to come to a natural end like it does when n is a positive whole number. 

(1 + x)~ l is giving us an infinite series. We've seen examples in Chapter 6 of the dangers 
connected with summing infinite series. 

Try tidying up this one yourself and see if you recognise what you get. Then you should 
be able to say whether this expansion works. If so, will this depend in any way on what value 
x has? 



Tidying up what we have above for the expansion of (1 + x) l , we get: 
(1 +x)~ 1 =— — = l-x + x 2 -x 3 +x 4 -... 

1 + X 

This is a GP with V = 1 and V = - x, and 1/(1 + x) is its sum to infinity. 
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So far, so good, but we know from Section 6.C.(c) that a GP only has a sum to infinity 
if its common ratio lies between -1 and +1. So we can say that, in this particular case, the 
expansion does work provided |-x| < 1. Now |-x| is the same as |x| since we are taking the 
positive value whatever the sign. So we must have |x| < 1, or -1 < x < 1, writing it another 
way. 

You can see for yourself that we will be in trouble if we don't stick to this. For example, 
suppose x = 2. This would give us 

1 

=1-2 + 4- 8 + ... 

1 + 2 

The problem here is that successive terms are getting bigger. These terms alternate in sign 
and so do the partial sums obtained by adding in each new term. Each of these is larger than 
the previous one in absolute size, so this series can't be getting closer and closer to | as we 
add more and more terms. 



It has been shown by mathematicians that (1 + can be expanded using (B2) if 
n is either negative or a fraction or both, provided that the 4 fits the requirement 
that |*| < 1. 

(4 stands for whatever we have in this position in the bracket.) 



7.C.(b) Working out some expansions 

Now we'll practise the mechanics of how these expansions go, because this process is just 
an extension of what we have been doing with binomial expansions for positive whole 
number powers, and it will be useful for you later on to be able to do this. 

Here are two examples of such expansions. 

Expand as far as the term in x 3 , stating the restrictions on the value of x in each case: 
(1) (1 + 3x)~ 2 (2) (1 - x/2) 1/2 

For (1), n = -2 and * = 3x. We must have |4| < 1, so we want |3x| < 1, which means 
-1 < 3x < 1, so -3 < x < 3. 

In order for the expansion to be possible, x must lie somewhere in this interval. 
If x does fit this requirement, we can say: 

(-2V-3) (-2)(-3)(-4) 
(1 + 3x)- 2 = 1 + (-2)(3x) + K —^- (3x) 2 + VV^V 0*? + ' • " 

2X1 3X2X1 

= 1 - 6x + 27x 2 - 108x 3 as far as the fourth term. 

For (2), n = \ and 4 = -x/2, so we want |-x/2| < 1. But |-x/2| = |x/2|, since we are taking 
the positive value whatever the sign. 

So we must have -1 < x/2 < 1 which means -2 < x < 2. 
Provided x fits this requirement, we can write: 



As 



A (b(-b ( A 2 (b(-b(-b 



2) 2X1 2 3X2X1 



g ^.2 3 



= 1 - — - — as far as the fourth term. 

4 32 128 
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Now, in each of the above cases, substitute x = 0.001 and see how closely the two sides 
match up, as you add in the extra terms on the RHS. You will find that, because x is small, 
you very quickly get close to the LHS, and indeed are beginning to find an answer accurate 
to more decimal places than your calculator is giving you, in the second case. This 
possibility of being able to replace an infinite series by a fast numerical equivalent to any 
desired degree of accuracy is often important in practical applications. 



exercise 7.C.1 Try expanding the following three examples yourself, as far as the term in x 3 , 

stating in each case the restrictions on x for the expansion to be valid. 

(1) (1 + 2x)-3 ( 2 ) (1 - 3*r (3) a + !*r 2 



7.C.(c) Dealing with slightly different situations 

What should we do if we want to find the expansion of (2 + 3x)~ 2 ? We can't any longer use 
the (B2) formula to expand this. 

I think that in such a case the simplest method is to rearrange the bracket so that it is in 
(1 + 4) form. Doing this simplifies the arithmetic quite a bit, as it avoids complicated and 
changing powers of V. 

So we write: 



(2 + 3jc)~ 



2 1 



3x 
2 



= 0-2 



3x 



1 



3x\- 2 
2 



^T^L It is important that the factor which we take out of the bracket was part of this 
^H-^V bracket, and so it is raised to the same power as the bracket itself. 

^^^^ Remember, too, that if you are taking out a factor, it applies to the whole 
^H-^V bracket, so we must write 3x/2, and not leave the 3x unchanged. 

For the expansion to be possible, what interval must x lie in? 



We must have 
3x 



< 1 



so 



3x 

1 < — < 1 
2 



so 



2 < 3x < 2 giving 



2 2 

- < x < -. 

3 3 



Expanding, using (B2), we get that 



3x 
1+ T I 



3x 

l + (-2)| y 



2X1 \ 2 / 
1 3x 27x 2 27x 3 



(-2)(-3) /3x\ 2 (-2)(-3)(-4) /3x 



3X2X1 



16 



This step needs to be done quite carefully if you are not to lose any bits! Try doing it yourself 

3 

as a check. Remember to square and cube the j when necessary. 



7.C When n is not a positive whole number 



277 



Here is another situation which you may meet. 
Suppose you need to find the expansion of 

1 

y (2-x)(l+2x) 

up to the term in x 3 , also finding the interval in which x must lie for the expansion to be 
valid. 

There are two ways of doing this. 

Method (i) We write 

y = (2 - x)- 1 (1 + 2X)- 1 = 2- 1 (1 - x/2)- 1 (1 + 2 X y l 
1 



2 



1 

2 



y.2 3 



[1 - 2x + 4x z - 8x J . . .] 



using the rules I gave at the end of the answer to question (2) of 
Exercise 7.C.1 to speed up the working inside these two brackets. 

Now we do the multiplying. This is not as bad as it might at first 
sight seem since we only want terms up to x 3 . 

I shall multiply the second bracket by each of the terms of the first 
bracket, only writing down the terms I need. This gives me 



2 [1 - 2x + 4x 2 - 8x 

) x x 2x 



I 2 _i_ n 3 

^X X ~r Z.X 

_l I 2 I 3 
A % 2 



+ |x 3 ] 



In 2 , 13 2 51 3-1 
2 L 2 X 4 X g X J 

1 3 13 2 51 3 

2 ~~ 4 X + 8 x ~ 16 x • 



Method (2) This avoids the multiplication by finding partial fractions for y. (Partial 
fractions are explained in Section 6.E.) Doing this gives 



I 2 

5 5 

y = " — 



■ x 1 + 2x 

= 1(2- xy 1 +|(1+ 2X)- 1 = fl - H +|(1+ 2x)- 



2 3 

X X X 



1 + - + — + — + . . . + j (1 - 2x + 4x 2 - 8x 3 + . . .) 
2 4 8 

3 13 2 51 3 

4 x + g x - 16 x 



writing down the first four terms. 



Finally, whichever method we used we must find the interval in which x must lie for the 
expansion to be valid. Both methods involved the same two expansions, so we look at each 
of these in turn. 

For the first, we want |-x/2| < 1 so |x/2| < 1 and - 2 < x < 2. 
For the second, we must have |2x| < 1 so - j < x < 5. 
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So, to fit both requirements, we must take the tighter of the two restrictions, so -5 < x < 5. 
This is the same situation as a lorry driving down a road which successfully makes it under 
the first bridge, but the headroom of the second bridge is lower. Disaster will strike unless 
the lorry is also lower than this second bridge. 

exercise 7.C.2 Try these for yourself. 

(1) Expand each of the following as far as the term in x 3 . In each case, find the 
interval in which x must lie for the expansion to be valid. 

(a) (1 - 3*) 1/3 (b) (1 + (c) (16 - 3*) lM 

(d) (4 + x)" l/2 (e) (- 2 + x)- 2 (f) (27 - 4*)~ 2/3 

You may need to look back at the rules for powers in Section i.D.(b) for help 
with the tidying up. 

(2) Expand (3 - 2x) _1 (1 + 3X)" 1 as far as the term in x 2 , and find the interval in 
which x must lie for this expansion to be valid. 

7.D Mathematical induction 

7.D.(a) Truth from patterns - or false mirages? 

If we find a particular pattern, how can we discover if this pattern will always be true or if 
it was just a lucky chance that it was true for the cases which we looked at? 

To answer this question, we will start by looking at the following pair of series. 

n 

(a) 1+2 + 3 + 4 + .. . + n= Xr 

r= 1 

n 

(b) l 3 + 2 3 + 3 3 + 4 3 + . . . + n 3 = X r 3 

r=\ 

An interesting thing happens if we compare the two sets of partial sums of these series, as 
they build up. 

If we take n = 1 , so we are just comparing the first terms, we get 

51 for (a) =1 and S x for (b) = 1. 
Summing the first two terms of each series, we get 

5 2 for (a) = 3 and S 2 for (b) = 9. 

Find S 3 and S 4 for each series yourself and see if you can suggest an experimental pattern 
for what is happening. 



You will have 

5 3 for (a) is 6, S 3 for (b) is 36, 

5 4 for (a) is 10, S 4 for (b) is 100. 

It rather looks as though, if we square the sum of (a) for any given number of terms, we get 
the corresponding sum for (b), for that number of terms. 
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In other words, it looks as though, if n is any number of terms we might choose to pick, 
then 

(S„ for (a)) 2 = S„ for (b). 

(Because n is counting the number of terms, it must be a positive whole number or natural 
number as these counting numbers are sometimes called.) 

Now, we have already found a formula for the sum of n terms of series (a) in Exercise 
6.B.I. 2(d). We found 

n(n + 1) 
S„ = - 
2 

Is it true that S„ for (b) is n 2 (n + l) 2 /4 whatever n is? 

We shall prove that this is true by using the following process. 



Mathematical induction: how to do it 

(1) We first show that a statement is true for the case in which n = 1. 

(2) We then show that 

if the statement is true when n is given some particular value, k, 
then it must also be true if n = k + 1 . 

We can then argue that, since we know it is true when n = 1, it must also be true 
for n = 2, and therefore also for n = 3, etc. through all the counting numbers. 



We have already done step (1) for this first example. 

Now we go to step (2). 

We will suppose that the statement 

" 3 n 2 (n + l) 2 



1 r- = 

r=\ 4 

is true when n is given the particular value, k, so that 

k 2 (k + l) 2 

l 3 + 2 3 + 3 3 + 4 3 + . . . + k 3 = (This is St[*].) 

4 

The St[k] at the right-hand edge of the line above is a convenient shorthand meaning 'the 
statement of the formula when n = k\ 

We then show, that if St[£] is true, then the formula is also true when n = k + 1 , so that 
St[k+ 1] is true. 

Here, we must show that if St[k] is true then 

(k + l) 2 (* + 2) 2 

l 3 + 2 3 + 3 3 + 4 3 + . . . + k 3 + (k+ 1) 3 = . (ThisisSt[£+ 1].) 

4 

We have added in the extra term on the left-hand side, and replaced k by k + 1 in the formula 
on the right-hand side. 

The LHS of St[k + 1] can be written as 

k 2 (k + l) 2 

(l 3 + 2 3 + 3 3 + 4 3 + . . . + k 3 ) + (k + l) 3 = + (k + l) 3 

4 

using St[k] to replace l 3 + 2 3 + 3 3 + . . . + k 3 with k 2 {k + \) 2 IA. 
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Now we factorise this, by taking out the common factor of (k + l) 2 . 
It will also pay us here to take out a factor of 4, as it is more convenient to have the 
fractions at the front, out of the way. 



Always look for factorisations at this stage. Multiplying out the brackets is 
long and tedious, and liable to bring in mistakes. You want to make the 
statement as simple as possible. 



This then gives 

k2 ( k + !) 2 ,1 , , 

+ (k+ l) 3 = \{k + l) 2 (k 2 + 4(k + 1)). 

4 

Notice the 4 inside the bracket, to make it multiply out correctly to give what we started 
with. But 

\(k + l) 2 (k 2 + 4k + 4) = \ (k + \ f{k + 2) 2 
so now we have 

l 3 + 2 3 + 3 3 + . . . + (k + l) 3 = \{k + l) 2 (k + 2) 2 = RHS of St[k + 1]. 

Therefore we have shown that ifSt[k] is true, then St[& + 1] is true. 

But we know that St[l] is true, so St[2] is true, and so on, for n = 3, 4, . . . through all 
the counting numbers. 

^^^^ It is important to be very careful about the 'if . . . 'then'' aspect of this 
^H-^V argument. 
* When using this method of proof we must always show that if the 

statement we are considering is true for n = a particular value, k, then it 
must also be true for n = k + 1 . 

(We can't give an actual numerical value of k since then a person could 
say 'Oh well, maybe it is true for that value, and the next one, but I don't 
see that that makes it true for any pair of values.' And they would be right.) 



Here is a second example of proof by induction. Prove that 

n 

X r 2 = \n{n + 1) (2n + 1). 

r = 1 

You may notice a rather serious disadvantage here! The method of mathematical induction 
is only going to be any use when we have somehow come to what the formula might be by 
some other route. It won't find an appropriate formula for us. 

Working with the formula we have been given here, we first check that it works for n = 1 , 
that is, that St[l] is true. 

Always start with this; if it is not true there is no point in proceeding any further, and if 
it is true, showing this is part of the chain of the proof. 

If n = 1, the LHS = 1, and the RHS = |(1)(2)(3) = 1 so it is true in this case. 
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Next, we suppose that the formula is true for n = a particular value, k, that is, we 
suppose 



St[Jt] 



> 2 + y + . . . + k 2 = iKk + \)(2k + 1) 

We then have to show that this would mean that the formula is also true for n = k + 1 , that 
is, we must show that, if St[£] is true, then 



k 2 + (k + l) 2 = L b (k + \)(k + 2)(2k + 3) 



St[* + 1] 



adding in the extra term on the LHS, and replacing k by k + 1 on the RHS. 
The LHS of St[k + 1] can then be rewritten as 



- b k(k + 1)(2£ + 1) + (k + l) 2 using St[k] to replace l 2 
Factorising in a similar way to the last example, we have 

\kik + 1) (2k +!) + (*+ I) 2 = \(k + 1) (k(2k + 1) + 6(k + 1)). 



2 2 + 3 2 + . . . + k 2 . 



If you are at all doubtful about your factorising at this stage, check by 
multiplying back that it agrees with the previous step. 



Tidying up, we get 

\(k + 1) (k(2k + 1) + 6(k + 1)) = \(k + \){2k 2 + lk+6) 

= \{k + l)(k + 2)(2k + 3) = RHS of St[k + 1]. 

Therefore, ifSt[k] is true, then St[k + 1] is true. But we know that St[l] is true, so therefore 
the statement is true for n = 2, 3, 4, . . . all through the counting numbers. 



5 



It is important that St[£] is shorthand for a statement. 
It is not a function or part of an equation. 

In the example above, you can't say St[£] = l 2 + 2 2 + 3 2 + . . . + k 2 
or St[Jt] = \k(k + 1)(2* + 1). 

St[£] is the statement that l 2 + 2 2 + 3 2 + . . . + k 2 = \k{k + 1) (2k + 1). 
St[k + 1] is the statement that 

l 2 + 2 2 + 3 2 + . . . + k 2 + (k + l) 2 = \(k + l)(k + 2)(2k + 3) 

St[^ + 1] is exactly the same as St[£] except that k has been replaced by k + 1. 



EXERCISE 7-D.l 



Try these similar questions yourself. 

(l) When we were working on APs, we found in Exercise 6.B.1 question 2(d) that 
1 + 2 + 3 + 4 + .. • + n = \n(n + l). 
See if you can prove this, using mathematical induction. 



282 



Binomial series and induction 



(2) First, see if you can spot a way of finding the sum of n odd numbers by 
looking at what you get for the first four sums, that is: 

(a) S a = 1 (b) S 2 = 1 + 3 (c) S 3 = l + 3 + 5 (d) S 4 = l + 3 + 5 + 7. 

Then, if you have guessed a formula, see if you can prove it is true by 
mathematical induction. 

(3) Show, using mathematical induction, that 

(1 X 2) + (2 X 3) + (3 X 4) + ... + n(n + 1) = (n/3)(n + + 2). 



7.D.(b) Proving the Binomial Theorem by induction 

As the summit of our ambition for this section, we will now prove the Binomial Theorem 
using induction. 

We have already done the only hard bit when we showed in question (3) of Exercise 7.A.5 
that 



k\ k\ (Jfc + 1) 



r\{k-r)\ (r- 1)1 (k + 1 - r)\ r\(k - r + 1)! 

So we now set out to show that 

n n(n - 1) n\ 

(1 + xf = 1 + — x + x 2 + ... + x r + . . . + x" 

1! 2! r\{n-r)\ 

where n is a positive whole number, by using mathematical induction. 

We first have to check that the statement is true when n = 1 (that is, that St[l] is 
true). 

If n = 1, we get (1 + x) 1 = 1 + x so St[l] is true. 

Now we have to show that if the formula is true when n = a particular value, k, then it 
must also be true when n = k + 1. (That is, we show that, z/StfA:] is true, then St[& + 1] is 
also true.) 

To write down St[&], we must replace n by this particular value k. So St[k] says 

k k(k-l) k\ 

(1 + xf = 1 + — x + x 2 + . . . + x r ~ 1 + 

1! 2! (r- iy.(k- r + 1)! 

x r + . . . + x k . 

r\ (k-r)l 

Notice that we have included the term with x r ~ 1 as well as the one with x r . Can you see 
why? 

St[£+ 1] states that 

(1 +x) k+ 1 = 1 + X + X 2 + . . . + x r + . ..+x k+ l . 

1! 2! r\{k+\-r)\ 

(To write this down, we just replaced V by l k + V in (B2).) 
But (1 +x) k+l = (1 +x)(l +*)*. 

We need to show that the term in x r resulting from this multiplication is the same as the term 
in x r in St[Jt + 1]. 
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But, just as in the examples we have already looked at, the term in x r in (1 + x) (1 + x) k 
comes from 

1 X (the term in x r from (1 + x) k ) + x X (the term in x r ~ 1 from (1 + x) k ). 
So we have to show that 

k\ k\ \ ( (k + 1)! 



r\(k-r)\ (r - 1)! (k - r + 1)!/ \r\ (k + I - r)\ 

but this is exactly what we have already shown in question (3) of Exercise 7.A.5. 
So we know that, ifSt[k] is true, then St[£ + 1] is also true. 

But St[l] is true, and therefore St[2] is true, and St[3] and so on through all the counting 
numbers, and the theorem is proved. 



7.D.(c) Two non-series applications of induction 

The method of mathematical induction is not just restricted to proving results for series. 
Here are two examples of other ways in which it can be used. 

example (l) Show that, if n is a positive integer, 9" -1 is always divisible by 8. 
As always, we test first by putting n = 1 . 

Doing this gives = 9 1 -1 = 9 - 1 = 8 so the statement is true for n = 1 . 

Now we suppose that 9" - 1 is divisible by 8 when n = a particular 
value, k. We can show this by writing 9* - 1 = 8M where M stands for 
some positive whole number. 

Stating that 9 k - 1 = 8M is St[it]. 

We have to show now that, if St[Aj is true, then St[k + 1 ] is also true, 
that is, that 9 k+ 1 - 1 is also divisible by 8. 

Now 9 k+ 1 - 1 = 9(9*) - 1 = 9(8M + 1) - 1 using St[*] to replace 9 k 

by 8M+ 1. 

So 9 k+ 1 - 1 = 12M + 9 - 1 = 12M + 8 = 8(9M + 1). 

Therefore 9 k+l - 1 is divisible by 8. 

We have shown that, z/St[/c] is true, then St[k + 1] is true. 

But St[l] is true so therefore St[2] is true, and so on through all the 
counting numbers. 



The juggling of the powers which we used above by writing 9 k+1 as 9(9*) so 
that we could substitute for 9 k is typical of what works for this type of 
question. 



example (2) Suppose that we have an infinite flat sheet of paper (which is the same 
as a plane in geometry). We then draw straight lines on it so that no 
two lines are parallel and no new line cuts through a point where two 
previous lines cross each other. 

How is the number of crossing points related to the number of lines? 
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Figure 7.D.1 



For example, from the sketches in Figure 7.D.1, 

• one line has no crossing points, 

• two lines have one crossing point, 

• three lines have three crossing points etc. 

Draw separate sketches for four and five lines (remembering that you 
must extend the lines sufficiently far so that all possible crossing points 
are counted). 

Now see if you can find a relationship between the number of lines 
and the number of crossing points. 



You should have got six crossing points for four lines and ten 
crossing points for five lines. 

If you had trouble spotting a relationship, doubling the number of 
crossing points may help you to see the pattern. 



You should then get a possible rule that n lines have jn(n - 1) 
crossing points. 

But we do not know that this is always true; further checking from 
sketches will only show it to be true for as many sketches as we draw. 
(Sometimes the most apparently beautiful patterns break down when n 
is quite large even though they have seemed fine until then.) 

However, we can now show that this formula is always true by 
induction. 

We know that it is true for n = 1 . 

Suppose that it is true for n = k so that k lines do cut each other in 
\k{k- 1) crossing points. This is St[&]. 

Now St[£ + 1] states that k + 1 lines would cut each other in 
\{k + l)(k) crossing points. Does this follow from St[£]? 

The (k + l)th line cuts all the previous k lines in k extra points, so 
drawing in this (k + \)th line gives us a total of jk(k - 1) + k cutting 
points. But 

jk(k - 1) + k = \k((k - 1) + 2) = \k{k + 1) 

so, if St[£] is true, then St[& + 1] is also true. 

But St[l] is true, and therefore St[2] is true, and so on for any 
possible number of lines. 



7.D Mathematical induction 
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In this chapter we look at how it is possible to describe relationships which are 
changing and how we can find out the rate of this change. 
The chapter is split up into the following sections. 

8.A Some problems answered and difficulties solved 

(a) How can we find a speed from knowing the distance travelled? 

(b) How does y = x" change as x changes? 

(c) Different ways of writing differentiation: dx/dt, f'(t), x, etc., 

(d) Some special cases of y = ax", 

(e) Differentiating x = cos f answers another thinking point, 

(f) Can we always differentiate? If not, why not? 

8.B Natural growth and decay - the number e 

(a) Even more money - compound interest and exponential growth, 

(b) What is the equation of this smooth growth curve? 

(c) Getting numerical results from the natural growth law of x = e f , 

(d) Relating In x to the log of x using other bases, 

(e) What do we get if we differentiate In f? 

8.C Differentiating more complicated functions 

(a) The Chain Rule, (b) Writing the Chain Rule as P(x) = f'(g(x))g'(x), 

(c) Differentiating functions with angles in degrees or logs to base 10, 

(d) The Product Rule, or 'uv' Rule, (e) The Quotient Rule, or 'u/v' Rule 

8.D The hyperbolic functions of sinh x and cosh x 

(a) Getting symmetries from e x and e~ x , (b) Differentiating sinh x and cosh x, 

(c) Using sinh x and cosh x to get other hyperbolic functions, 

(d) Comparing other hyperbolic and trig formulas - Osborn's Rule, 

(e) Finding the inverse function for sinh x, 

(f) Can we find an inverse function for cosh x? 

(g) tanh x and its inverse function tanh" 1 x, 

(h) What's in a name? Why 'hyperbolic' functions? 

(i) Differentiating inverse trig and hyperbolic functions, 

8.E Some uses for differentiation 

(a) Finding the equations of tangents to particular curves, 

(b) Finding turning points and points of inflection, 

(c) General rules for sketching curves, (d) Some practical uses of turning points, 
(e) A clever use for tangents - the Newton-Raphson Rule 

8.F Implicit differentiation 

(a) How implicit differentiation works, using circles as examples, 

(b) Using implicit differentiation with more complicated relationships, 

(c) Differentiating inverse functions implicitly, 

(d) Differentiating exponential functions like x = 2 f , 

(e) A practical application of implicit differentiation, 

8.G Writing functions in an alternative form using series 
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8.A Some problems answered and difficulties solved 

What kinds of things can differentiation tell us? I find that sometimes students know some 
rules but don't really know what use these rules are. We start this chapter by looking at some 
examples based on earlier thinking points. In these, we wanted to find answers to what is 
happening in particular physical situations. 

If you see how we can use differentiation to help us here, you will understand better what 
kinds of things it can do for you. 

8.A.(a) How can we find a speed from knowing the distance travelled? 

Suppose somebody is walking at a steady speed of 3 miles per hour (m.p.h.). Then the 
distance travelled for different lengths of time can be shown on a graph sketch like the one 
in Figure 8.A.I. 




Figure 8.A.1 



Since equal distances are covered in equal intervals of time, the speed is represented by 
the gradient of the line, and this can be found by using any of the triangles I have drawn in; 
the size does not matter. 

Any two points (x Y , y r ) and (x 2 , y 2 ) on the line will give its gradient, using the 
formula 

m = from Section 2.B.(d). 

Each of these triangles will give a gradient of 3. This represents the constant rate of 
change of distance travelled or steady speed, of 3 m.p.h. 

But how can we find the speed if the rate at which the distance is covered is continually 
changing? 

This question first came up at the end of the thinking point of Section 2.D.(g), in which 
we looked at how the motion of a ball thrown up in the air changes as time passes. 
Look at this again so that we can use it together now. 

Because of the pull of gravity, the speed of the ball is changing all the while. It is 
moving fastest when it leaves the thrower's hands and when it returns to them; and slowest 
when it comes instantaneously to rest at the highest point of its motion. (We can say that 
it does this because there is an instant in its motion when, rather like the Grand Old Duke 
of York, it is neither moving up nor down.) Between these two extremes, its speed is 
changing smoothly, so that the graph of the distance travelled against the time that this 
has taken is a curve. 
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The last question I asked you in this thinking point was whether you could think of a way 
of estimating the ball's speed one second after it has been thrown up in the air. Surely since 
we can find how far it has travelled at any instant we should be able to do this? 

We used the equation s = ut -jgt 2 to give us the distance s in metres (m), travelled by 
the ball after a time of t seconds (s), if it is thrown up at a speed of u metres per second 
(ms- 1 ). 

In our example, the ball was thrown up at a speed of 14 ms -1 , and we took g, the 
acceleration due to gravity, as 9.8 metres per second per second (ms -2 ). 
This then gave us the equation of s = 14? - 4.9t 2 for the curve. 

I have drawn a new sketch graph, in Figure 8.A.2.(a), showing how the height of the ball 
changes with time over the first 1 .4 seconds of the motion. 
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I have drawn in the separate changes in height for each 0.2 second interval on this graph, 
to give a picture of how the speed is changing. You can see the inaccuracy in this by drawing 
in the slant sides of the triangles yourself. The slopes or gradients of these slant sides are 
giving the average speeds over each 0.2 second interval, but they only give an approximation 
to the actual shape of the curve. It seems reasonable to think that, at any point where two 
adjacent triangles touch it, the steepness of the curve will be somewhere between the 
steepness of the slant sides of these two triangles. 

Taking the equation of the curve as s = I4t - 4.9t 2 , we can make the table (a) below for 
the different values of s. 



(a) 



(b) 



t 


0 


0.2 


0.4 


0.6 


0.8 


1.0 


1.2 


1.4 


s 


0 


2.60 


4.82 


6.64 


8.06 


9.10 


9.74 


10.00 



t 


0.8 


0.9 


1.0 


1.1 


1.2 


s 


8.06 


8.63 


9.10 


9.47 


9.74 



We now use the triangles either side of t = 1 to get estimates of the speed when t = 1 . 
I will call the change in height As and the corresponding change in time At. 
(A is the Greek capital D, pronounced 'delta'. It is often used to mean 'the change in'. 
We have used it this way already in Section 3.A.(b).) 
The left-hand triangle gives 

As 9.10-8.06 

— = = 5.2ms- 1 . 

At 0.2 

The right-hand triangle gives 

As 9.74-9.10 

— = = 3.2ms- 1 . 

At 0.2 

From Figure 8.A.2(a), we believe that 5.2ms- 1 is an over-estimate and 3.2ms- 1 is an 
under-estimate of the speed when t = 1 . 

Next, we try taking smaller time intervals either side of t = 1 . I have done this in table 
(b), and I show the separate changes in height on this small section of curve in Figure 
8.A.2(b). Again, you should draw in the slant sides yourself. 

Taking the two triangles on either side of t = 1 again, the left-hand triangle gives 

As 0.47 

— = = 4.7ms- 1 

At 0.1 

and the right-hand triangle gives 

As 0.37 

— = = 3.7ms" 1 . 

At 0.1 

We see that we are getting closer to an agreement between the estimates. 
Infilling again in the same kind of way gives us the table below. 



t 


0.90 


0.95 


1.00 


1.05 


1.10 


s 


8.63 


8.88 


9.10 


9.30 


9.47 
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I have shown again, in Figure 8.A.3, a magnified picture of the small part of the curve 
which we are considering here. If you now draw in the slant sides of the triangles, you will 
find that they are almost indistinguishable from the curve itself. 

Since the differences are now becoming very small, it seems a good idea to show this by 
labelling them in a slightly different way. I shall use d, which is the small Greek letter d, and 
call the changes ds and dt. d is very commonly used in maths to mean 'a small change in'. 

Now, looking at the two small triangles either side of t = 1 shown in Figure 8.A.3, the left- 
hand triangle gives 

ds 0.22 

— = = 4.4 ms 1 

dt 0.05 

and the right-hand triangle gives 

ds 0.20 

— = = 4ms 

dt 0.05 

So, coming from the left and from the right, we have two sets of approximations which 
are getting closer and closer to the speed at the instant when t = 1. We have 

5.2 -> 4.7 -> 4.4 and 4 <- 3.7 <- 3.2 

This system looks very promising. We can see that the smaller the differences are the better 
the approximation is, so perhaps we should focus on making the differences extremely small 
and see what happens? 

We don't want to specify exactly how small since, for any given interval, we know we 
could always halve that and so get a better approximation. 

So what we will do is to look at what happens to ds/dt, just making the proviso that we 
are letting dt become smaller and smaller. We are snuggling the little triangles in closer and 
closer to t = 1 from both sides. 

Also, it would be much nicer if we could get a rule for finding the speed which works 
for different initial speeds, u, and for the slightly different possible values of g as we travel 
over the earth's surface, so that we don't have to recalculate every time these are different. 
So, instead of taking particular values, we will work with u and g. 

We start with s = ut -jgt 2 and then see what happens to this equation at the nearby time 
of t + dt. 
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If the time has changed by a small amount dt then the distance s will also have changed 
by a correspondingly small amount ds. So we will have 

s + ds = u(t + dt) -\g(t + dt) 2 . 

Now, 

(t + dt) 2 = t 2 + 2t(dt) + (dt) 2 . 

So 

s + ds = ut + M ((50 -jgf 2 - gt(dt) -\g(dt) 2 (1) 
But, at time t, 

s = ut -jgt 2 (2) 

Subtracting (2) from (1) gives 

ds = u(dt) - gt(dt) -\g{dtf 

ds j 
so — = u - gt -2g(dt). 
dt 

But, if we now let dt get closer and closer to zero, it will become so small that we can ignore 
the -\g(dt). 

Because ds is also becoming very small, the fraction ds/dt continues to give the slope of 
the slant side of the little triangle. The smaller this triangle becomes, the closer this slope 
gets to the slope of the curve itself at the point (t, s). 

As dt gets smaller and smaller, ds/dt will become closer and closer in size to u - gt. 

We write this mathematically by saying that the limit of ds/dt as dt — > 0 is u - gt. 



ds 


ds 


The limit of — 


as dt — > 0 is called — . 


dt 


dt 



In this particular example, we have ds/dt = u-gt. We now have a rule to tell us the speed 
at any point on the path of the ball. 



The value of ds/dt tells us the rate of change of s with respect to t for any chosen 
value of t while the ball is still in motion. 

The line with gradient ds/dt which touches the curve at this particular value of t, 
showing its steepness there, is called the tangent to the curve at this point. 



Returning to the particular case of u = 14 and g = 9.8, we can now work out the speed of 
the ball one second after it has been thrown into the air. 

It is given by v = ds/dt = u - gt = 14 - 9.8 = 4.2, so the speed is 4.2 ms 4 . I show this 
on Figure 8.A.4(a). I also show again, in Figure 8.A.4(b), the little sketch of the actual path 
of the ball, which is straight up and straight down. The graph of Figure 8.A.4(a) shows how 
its distance from the ground changes with time. 
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The gradient of the curve at A, that is, of its tangent there, is 4.2. The speed of the ball 
after half a second is 4.2 m s _1 vertically upwards. 

Similarly, if t = 2, dsldt = -5.6. The gradient of the curve, given by the gradient of its 
tangent at B, is negative. The speed of the ball is 5.6 ms _1 vertically downwards. 

Taking the vertically upwards direction as positive, we can say that the velocity of the ball 
(which describes the direction of its motion as well as its speed) is 4.2 m s _1 at A and 
-5.6 ms" 1 at B. 




When you first looked at this thinking point, because the acceleration is 
constant, you may have used the formula v = u + at to find the speed when 
t = 1, putting u — 14 and a = -9.8. This also gives v = 4.2. This method 
works very well in this particular example, but the method we have just been 
looking at above is enormously more powerful because it can cope with 
situations of non-constant acceleration (and much else besides). 



8.A.(b) How does y = x n change as x changes? 

We can now answer this question provided that n is a positive whole number. 

(I am putting in just enough examples here of where these formulas come from to show 
you how they link back to past work, and to justify using them in their hundreds of 
applications.) 

We will look at what kind of small change, dy, we will get in_y if we change x by the small 
amount dx. We have 

y = x" so y + dy = (x + dx)". 

Now, we can expand (x + dx)" using Rule (Bl) from Section 7.A.(e). This gives us 

n(n - 1) 

y + dy = x" + nx"- 1 (dx) + — ^-^ x"~ 2 (dx) 2 
+ terms with higher powers of dx. 
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Putting y = x", and tidying up, gives us 

n(n - 1) 



dy = nx"- 1 (dx) + 



2! 



so 



dx 



= wx"- 1 + x ra " 



2! 



x" (<3x) + other terms with higher powers of dx 



(dx) + other terms with higher powers of dx. 



If we now let dx — > 0, everything except nx" 1 becomes so small that we can ignore it, and 
we have 



& y 

The limit of — as dx — > 0 is nx" . 

dx 



dy 

\iy = x n then — = nx"~ l 



dx 



We know that this result is true if n is a positive whole number because we showed that 
the Binomial Theorem is true in this case. 

Mathematicians have shown that this result is still true if n is any real number, and we 
will use this widened version. 

Multiplying by a constant, a, will just have the effect of multiplying the answer by a. This 
gives us the following general rule. 



dy 

If y = ax n then — = na.\ 
dx 



n-\ 



Doing this process is called differentiating (with respect to x if the function is in terms 
of x, or with respect to t if it is in terms of t etc.). 

If we have a string of terms similar to this which are added or subtracted, we can go 
through differentiating term by term in order to find the total rate of change, so, for example, 
if y = 3x 2 + 2x, then dyldx = 6x + 2. 

8.A.(c) Different ways of writing differentiation: dx/dt, f'(t), x, etc. 

There is another way of writing dyldx, dx/dt, etc. which emphasises more that we are doing 
the process of differentiation to functions. 

In Chapter 3, we used/(x)> an d so on to talk about functions of x and t. 

dy 

If we have y =f[x), then — is also sometimes written as f'{x). 

dx 

dx 

If we have x =fit), then — can also be written as f'(t). 

dt 

Writing x = f[t) stresses that x is a function of the variable t. 

The dash in f(t) means that the function J{t) has been differentiated with respect to this 
variable. 
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In the particular circumstances when x 
notation is used. 

In this notation dxldt is written as x. 



=J[t) is a function of time, sometimes the dot 



lfx=J[t) then 



dx 
dt 



= x=f'(t). 



Historically, the ideas of calculus were developed separately but in parallel by eminent 
(but rivalrous) mathematicians. 

The notation dxldt was used by the German mathematician Leibnitz. 

The notation x was used by the English mathematician and physicist Newton. 

Here are some examples, using the two most usual notations. 



dy 

(1) If y =J[x) = 3x 4 + 2x 3 then — = fix) = 12x 3 

dx 

(2) If* =X0 = 2? + j? 3 then — =/'(*) = 2 4 

dt 

(3) If x =J[t) = 5t 4 4t m then — =f'(t) = 5 



6x 2 . 



3 -t 2 

2 l ■ 



dt 



4 X 5 X r 1/2 



5 4 It 



-1/2 



(4) 



If y =J{x) = 5x + — = 5x + 2x- 2 then — =f'(x) = 5- 4x~ 3 = 5 



dy 
dx 



(If you are unsure about the use of powers here, see Section 1 .D.) 

exercise 8.A.1 Try these for yourself. Differentiate with respect to whatever letter the function is 

written in on the right-hand side. 

(l) y=7x 2 + 3X 4 (2) x = st -\t 3 (3) y = 3 - 2/x 3 (4) x = 2t l/2 + 3t' l/z . 
(5) (a) Show, by thinking about what happens when x is increased by a small 

amount dx, that if / = x 3 then dy/dx = 3X 2 . 
(5) (b) Check what happens at each stage of your working numerically by taking 

the particular case of x = 2 and dx = 0.001. 



8.A.(d) Some special cases of y = ax" 

Students sometimes have difficulty linking the rule for differentiating y = ax" back to these 
two particular cases, so I have put in two examples here to show how this works. 



(1) 



(2) 



If n = 1 then j = ax" is the straight line y = ax. (This is using x 1 = x from Section 
l.D.(b).) 

For example, if y = 3x then, using the rule above, we get dy/dx = 3x° = 3 since 
x° = 1. (This is also in Section l.D.(b).) 

The result of using the rule agrees entirely with what we know to be the 
gradient of the line. (See Figure 8.A.5(a).) 

If n = 0 then we have a very particular kind of straight line of the form y = a where 
a is some number. 

For example, if y = 4 then we can say y = 4x°. 

Now using the rule gives us dy/dx = 0 X 4x _1 = 0. 

Again, this fits in with what we can see to be true in Figure 8.A.5(b). 

The line y = 4 is horizontal and its gradient is zero. 
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Figure 8.A.5 



>3C 



-2 -I o 
-2 



^54 



Two special cases 

dy 

lfy = ax then — = a. 

dx 



lfy = a then 



dy 
dx 



= 0. 



(a stands for any constant number.) 



8.A.(e) Differentiating x = cos f answers another thinking point 

In Section 5.A.(d), we looked at how the point X moves on the line AB as P moves round 
a circle of unit radius at 1 rad/s. You should go back to this now, and answer the questions 
there, if you haven't already done so. Because this particular kind of motion is of enormous 
importance in physics and engineering applications, I will use it as a last example of how 
we can find a rate of change by considering what happens over smaller and smaller time 
intervals. After this, we will use these results as we need them without specifically proving 
any further ones. 

I show the diagram again here in Figure 8.A.6. The final question of this thinking point 
was to find the speed of X after a time interval of t seconds, knowing that the distance OX 
is given by OX = x = cos t. 




Palter timet 



Start 

^ t=0 



OX - oa ~ cos t 
Figure 8.A.6 
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We would also like the answer to tell us whether X is moving from left to right, in which 
case x is increasing and the motion is in the positive direction; or from right to left, in which 
case x is decreasing and the direction of the motion is negative. 

If we can find the speed with its attached + or - sign then we will have found the velocity 
of the point X I have shown the graph of the distance x moved by Xas P goes round its circle 
in Figure 8.A.7. 




Figure 8.A.7 



We know x = cos t. 

How does x change as t changes? 

We saw in the thinking point that X moves fastest as it passes through O and 
instantaneously comes to rest every time it gets to either A or B because then it turns back 
on itself. 

Also, when t = 0, it starts by moving in the negative direction towards O. Its velocity will 
be negative for the first ji seconds of its motion. Also, its velocity changes regularly with 
time just like its distance from O does. 

Do you have any idea how you could write this velocity in terms of tl 



Could it be that the rate of change of Xs distance from O with time, that is dxldt, is equal 
to - sin ft To answer this question, we shall look how x changes if we change t by a small 
amount dt. 

We are again looking at the gradients of the slanted sides of the little triangles as they 
tuck in closer and closer to any particular point Q on the curve x = cos t. I show a possible 
pair in Figure 8. A. 8. 

To find dxldt, we have to find the limiting value of dxldt as dt — > 0. 




Figure 8.A.8 
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If the time changes by a small amount dt, so that the distance changes by a 
correspondingly small amount dx, we have x + dx = cos(t + dt). Which of the formulas from 
Chapter 5 can we use here on the RHS? 



We can use cos(^4 + B) = cos A cos B - sin A sin B (Section 5.D.(b)). So then we have 

x + dx = cos t cos (dt) - sin t sin(dt). 

Now comes the step which only works because we are measuring the angle turned through 
by P in radians. 

In Section 4.D.(e), we looked at some special properties of very small angles measured 
in radians. (Have another look at this section now.) 

We found there that, for a very small angle 9, cos 9 — > 1 as 9 — > 0, and sin 9 — > 9 as 
9 — > 0. So here, cos (dt) — > 1 and sin (dt) — > dt as dt — > 0. Therefore 

as dt — > 0, x + — > cos f - (dt) sin ? 

but 

dx 

x = cos ? so dx — > -((5?) sin f so > - sin ? as (5? — > 0. 

dt 

Therefore we have the following result. 



dx 

If x = cos t then — = - sin t. 

dt 



So the velocity of the point X after time t is - sin t. 

If the radius of the circle is 1 metre then, when X passes through O on its way to B, it 
has a velocity of - sin jt/2 = -1 ms 4 . 

The corresponding result for the curve y = sin t can be shown in a very similar way. Try 
doing this for yourself. This is what you should get. 



dy d 
If y = sin t then — = cos t or — (sin t) = cos t. 
dt dt 



We are now able to get a very interesting result for the motion of X. 
The rate of change of velocity with time is acceleration. But 

d 

— (- sin t) = - cos t = -x. 
dt 

So the acceleration of the point X is always towards O and equal in magnitude to the distance 
of X from O. 

This means that, if X is a particle of unit mass, then the force on X which would make 
it move in this way is also equal in size to the distance of X from O, and always acts 
towards O. 
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These last two results will be unchanged for a larger circle but, if the speed of P is different, 
the relationship will be altered by some constant factor depending on the new speed. 

The point X is moving in what is called simple harmonic motion (SHM). 

A physical example of this is the motion of the bob of a simple pendulum. 

The joint effects of the force of gravity and the tension in the string on the bob produce 
a force on it which gives it an acceleration of the kind we have described. 

We said above that acceleration is the rate of change of velocity with time. 

Also, velocity is the rate of change of distance with time. 

So acceleration is the rate of change of a quantity which is itself a rate of change. 
If we call the velocity v and the acceleration a, then 

dx dv d I dx\ 

v = — and a = — so a = — — . 
dt dt dt \ dt) 

This is written as 

d 2 x 

~d7' 

So, here, we can say that 

d 2 x 

~df~~ X ' 

This is an example of what is called a differential equation. A differential equation is an 
equation which includes terms like dx/dt or d z xldt z . 

We know the solution of this particular example of this equation, which is that x = cos t. 
We'll look at some more equations like this in section 9.C.(c). 



dx 

— is called the first derivative of x with respect to t. 
dt 

d 2 x 

is called the second derivative of x with respect to t. 

dt 2 



This is what happens with the other notations. 

dx d 2 x 

If x =fif) then — =f'(t) = x and =f'(t) = x. 

dt dt 2 

exercise 8.A.2 (1) Do we get the same kind of results if we look at the motion of the point Y on 

the vertical axis described near the end of Section 5.C.(b)? 
The distance OY is given by y = sin r. 

Find for yourself the velocity dy/dt and the acceleration d 2 y/dt 2 of Y. 
Can you link up d 2 y/dt 2 and y by an equation? 

(2) What happens if we have an object moving so that its distance from the origin 
can be described as a combination of sin r and cos f? 

For example, what would happen if we had x = 3 cos r + 4 sin r? 
Find dx/dt and d 2 x/dt 2 , and see if you can find a linking equation between 
x and d 2 x/dt 2 . 
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8.A.(f) Can we always differentiate? If not, why not? 

In all the examples which we have looked at, we have been using the same process of tucking 
the little triangles in closer and closer to the point we are considering on the curve, to get 
better and better approximations to its steepness and so to its rate of change at that point. 
Is it always possible to do this? 

If we have some relationship giving y in terms of x, can we always go ahead and find 

dylchc! 

What kinds of thing might happen which would mean that we could not differentiate y 
with respect to x? 

The graph sketches in Figure 8.A.9 may suggest some potential problems to you. 




Figure 8.A.9 



Also, suppose we can no longer draw the small triangles near some point on a graph 
because tiny differences in x give rise to huge differences in yl Can you think of such an 
example on any of the graphs which we have already sketched in this book? 

Make a list for yourself of all the circumstances which you think will spell trouble for the 
process of differentiating. 



I hope that you will have thought of some of these possibilities. 

In order to differentiate successfully, we must have the following conditions. 

(1) There must be no breaks or discontinuities as at A in Figure 8.A.9(a). 

There is no meaning to the slope at the point where the break is. 

(2) It must be true that, moving in from either side with the little triangles, we get the 
same slope for the tangent that we are considering. The left-hand limiting value 
must be equal to the right-hand limiting value. 

For example, we can't find dyldx at the points B and C in Figure 8A.9(b) 
and (c). 

(3) The graph cannot be infinitely wiggly like a fractal curve where, however small the 
scale you take, the outline is still very similar to the one I have drawn in Figure 
8.A.9(d). A coastline looks much the same in whatever detail you look at it, with 
smaller and smaller inlets being revealed. For a curve like this, it is impossible to 
define the slope at any point on it. 
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(4) It must be true that there is a limiting value to be found, so tiny changes in x don't 
give uncontrollably huge changes in_y. This is what happens, for example, as we get 
closer and closer to x = Jill in the graph of y = tan x. 

Any graph which has some value of x for which the function is undefined 
because it is impossible to divide by zero will give a discontinuity like this. 

Another example is the function J[x) = (x + 3)/(x - 2) which we drew in Figure 
3.B.16 in Section 3.B.(i). It has a discontinuity like this whenx = 2. dyldx does not 
exist for this value of x. 

Unfortunately, it isn't possible to produce watertight definitions of the problems just by 
using pictures. 

For example, in Figure 8.A.9(b), would we be all right if we rounded off the sharp point? 
How rounded off is the balance point of a see-saw? 

How close to the origin can we get in Figure 8.A.9(e) before the wiggles become so 
violent it is impossible to find the slope? 

Suppose we severely squash the horizontal scale on an ordinary sin graph. It will then 
become very wiggly. If we squash it far enough can we make it impossible to find the slope? 
But surely that would be ridiculous? How could differentiation depend on the personal scale 
we have chosen? 

The study of how continuity and differentiability can be defined rigorously to make clear 
just what is possible is what mathematicians call analysis. 

I have tried here to give you enough of an insight into what is happening so that you will 
have a feel of when there might be a problem, and be suitably cautious. 




8.B Natural growth and decay - the number e 

I have found that many students regard e as something of a mystery - something that 
obviously matters a lot in calculus because it is always being used but why? You will know, 
if you are studying science or engineering, that e is involved in many of the equations which 
describe the physical relationships which are important in your subject. This next section 
sets out to give you at least some of the reasons why e is so important. If you are in a hurry, 
you can leave the reading of it until later, but you should go through highlighting all the 
boxes of important results, both so that you can use them now and also to pinpoint them for 
yourself if you want to do more investigation later on. 

I have already described some relationships of natural growth in Section 3 .C. If you want to 
understand how e works, you should start by having another look at this before going on. 
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8.B.(a) Even more money - compound interest and exponential growth 

In Section 6.C.(h) we looked at how it is possible to make invested money grow faster by 
using a system of compound interest so that the new interest is calculated as a percentage 
not only of the original amount of money invested, but also of the interest which has so far 
been accumulated. 

I said there that this updating of interest is usually done either yearly or six-monthly 
Would the shorter time interval make very much difference? We would expect it to make 
some difference because there will be some interest at the end of six months. At the end of 
the year, you would receive interest on this interest as well as the interest on the original 
amount of money which you invested. 

If this case, how much better would it be to have an even shorter time interval, say three 
monthly? 

This is an important question to answer because rates of growth which depend on how 
much of a quantity is present at any particular time are very important in many real-life 
physical situations. 

Rather than returning to the situation of Section 6.C.(h), we will look at a slightly 
different picture. It turns out to be particularly interesting to start from the special case of 
what happens when the extra amount or interest received at the end of a unit time interval 
is equal to the amount originally saved. 

Unfortunately, this is an unlikely arrangement for a bank to make, so we shall look at the 
following example instead. 

Suppose there is a group of cousins who each receive £100 from their wealthy uncle one 
Christmas. So strongly does he feel about the virtues of prudence and thrift that he says he 
will arrange things so that their savings increase at an equal rate to the amount saved, so that 
if the £100 is saved until the following Christmas, he will then add a further £100 to it. 

All five cousins decide that they will save their £100. 

The first cousin is happy to look forward to receiving the extra £100 the next Christmas, 
which will then give him a total of £200. 

The second cousin decides to capitalise on her uncle's offer by suggesting that he increase 
her savings by a system of compound interest. She will split the year into two halves. Her 
uncle will give her £50 at the end of the first half-year, so she will have £150. 

Since she will then be saving £150 instead of £100, at the end of the second half-year she 
will get an extra £75 instead of just £50, so giving her a total of £225 at the end of the 
year. 

Her uncle agrees, so we can write this in the same form which we used with compound 
interest in Section 6.C.(h). 
We have 

Start Mid-year End of year 

£100 -» £100 + 5 (£100) -> [(£100 + \ (£100)) + \ (£100 + \ (£100))] 

or 

£100 ^ (1+|) £100 -> [(1 +5) £100 + 5(1 +5) £100] 
which tidies up as 

£100 -» (1 + \) £100 -> (1 + \f £100 = £225. 

The two steps in her savings are given by the second and third terms of a geometric 

1 3 

progression (GP) which has a first term of £100 and a common ratio of (1 + j) = 1- 
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The third cousin, seeing this calculation, considers that having the interest updated 
quarterly would be even more beneficial. 

The pattern for her quarterly updates will go 

£100 -> (1 + J) £100 -> (1 + \f £100 -> (1 + \f £100 -> (1 + \) A £100 
Start 1st quarter 2nd quarter 3rd quarter End of year 

giving her a total at the next Christmas of £244.14 to the nearest penny. 

Again, the four steps in the savings are given by a GP, wth a common ratio this time of 

(!+?) = !■ 

How much would the fourth cousin (who negotiates monthly updates) get by the end of 
the year? 



He would get (1 + yj) 12 £100 = £261.30 to the nearest penny. 

This time, the twelve steps of the savings are given by a GP which has a common ratio 

of (i + n) = If- 

The fifth and youngest cousin is keen to see how much she can negotiate to get. 
Try estimating for yourself how much you think she might get. What do you think her 
best arrangement would be? 



She decides to go for the most extreme position and says 

'If I am saving as you want, could we not consider that, over the year, the money that you 
will give me becomes more and more mine, and so it can really be considered as feeding in 
continuously to become part of my savings as the year goes by. And then I shall be getting 
a rate of increase equal to the total amount I have saved all the while. Since we are reckoning 
here on infinitely small time intervals, I shall do infinitely better than any of my other 
cousins!' 

Is she right? 

If we look at what happens as the time intervals become shorter, we find the 
following: 

Weekly updates would give her a final total of (1 + ^) 52 £100 = £269.26. 
Daily updates would give her a final total of (1 + 3^) 365 £100 = £271.46. 
Hourly updates would give her (1 + gy^) 8760 £100 = £271.81. 
See for yourself what happens if the interest is updated every minute. 
The amounts are increasing, but more and more slowly. 

Now we know that the increases for the first four cousins are all coming in definite steps, 
and we saw that each scheme was described by a different GP. 

The increases given by updates every minute are still described by a GP, this time with 
525 600 steps, and a common ratio of 

1 525 601 

1 + = . 

525 600 525 600 

The steps are now exceedingly tiny, but they are still there. This GP would give a grand total 
at the end of the year of £271.82 to the nearest penny. 

When the youngest cousin gets what she wants, the steps will have been smoothed out 
to give a continuous growth curve. We know that her £100 will have been multiplied by a 
factor of about 2.7182 by the end of the year. 

What is this number which is equal to about 2.7182? 
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To find the answer to this, we'll now look at the pattern of her increases as the time 
intervals get shorter and shorter. These go 

£100 ->(!+-) £100 -H 1 + - 1 £100 ->...-> ( 1 +-] £100 



ni \ n) \ n 



where n is as large a number as we care to think of, as she is breaking her year into infinitely 
short time intervals. So she finishes up with 

1\» 

1 + - £100 as n -> ». 



n, 

Now, we can do a binomial expansion on 
1\» 

1+ " ■ 

nj 

We use the formula (B2), from Section 7.A.(e), which starts 

n n(n - 1) n(n - \){n - 2) 

(1 + x) n = 1 + — x + x z + x 3 + . . . 

1! 2! 3! 

We have to put x = l/n, where n is a positive whole number, but a very large one indeed. We 
get 

« / 1 \ «(« - 1) / 1 \ 2 + n(n - 1)(« - 2) / 1 y + 



1! W 2! W 3! 

and, as w becomes larger and larger, « - 1, n - 2, etc. are all relatively close to n. 
We are getting nearer and nearer to the series 

1111 

1 + — + — + — + — + ... 
1! 2! 3! 4! 

as the amount by which we must multiply the £100 to find her total savings. 

As we go further and further in summing this series, we find that the running sum gets 
closer and closer to a value of about 2.71828, so she gets £271.83 to the nearest penny, doing 
the best of the cousins, but not dramatically better than her next cousin. 

This number, to which the pretty series 

1111 

1 + — + — + — + — + ... 
1! 2! 3! 4! 

converges is extremely important mathematically, and is indeed the famous e. 

You can see its value to as many places as your calculator will allow, by putting in 1 and 
then pressing e x . 

We now have this important result. 



/ 1\» 


1 


1 


1 








+ — - 


l- — + . 


. = e. 


As n — > oo, 11 + — J -> 1 ^ 


1! 


2! 


3! 





We have found in this section that, when the interest is updated at the end of equal time 
intervals, so that the total amount of money is increasing in separate jumps, then these 
increasing amounts of money form the terms of a GP (with a different GP for each set of 
equal time intervals). 
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Figure 8.B.1 



However, when the interest is updated continuously, so that the amount of money saved 
is increasing smoothly all the while, the result is no longer described by the steps of a GP 
but by a smooth growth curve. 

You can see these differences in Figure 8.B.1 where I show the growth in the savings of 
the second, fourth and youngest cousin. 

8.B.(b) What is the equation of this smooth growth curve? 

In order to be able to apply the mechanism of this smooth growth curve to other situations, 
we need to know what its equation is. 

It becomes easier to see what this must be if we look at how the differences between the 
graphs are building up at an intermediate point. 

For example, after six months we have the following totals. 

The first cousin still has £100. 

The second cousin has (1 + 5) £100 = £150. 

The third cousin has (1 + \f £100 = £156.25. 

The fourth cousin has (1 + jj) 6 £100 = £161.65. 

We can emphasise that we are considering a half-yearly interval here by writing 

(1+5) 2 = [(1+5) 4 ] 1/2 and (1+^) 6 = [(1+^) 12 ] 1/2 

so, for example, the fourth cousin has 

[(1 +n) 12 ] 1/2 £100 = £161.65. 

It then seems reasonable to say that, at the end of the half-year, the fifth cousin would 
have 

[(1 + b"] 1 ' 2 £100 = e m X £100 since, as n », (1 + \f e . 

Now, the accumulating totals for the first four cousins increase in definite jumps, but the 
total for the fifth cousin is increasing smoothly, so it would seem reasonable to say that, after 
a time interval of any length t, where t is measured in years, she would have a total of 
e' X £100. 
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Her smooth growth curve has the equation x = 100 e' where t represents the time interval 
along the horizontal axis and x represents her total savings in £ s. 

Because the rate of increase of e' is equal to e 1 itself for any value of t, it must be true 
that 



d 

— (e<) = e*. 
dt 



This property of e* that its rate of change is always equal to e l itself makes it very 
special. 

If you tried drawing the sketch in the thinking point of Section 3.C.(e), you should have 
found that the gradient of the tangent when x = 1.5 was about the same as the height of the 
curve for that value of x. 

8.B.(c) Getting numerical results from the natural growth law of x = e f 

I have taken the simplest possible form of the natural growth law here, leaving out the 100 
which we included for the £100 earlier, to make this section simpler. 
Starting from x = e', see if you can answer the following questions. 

(1) What isx if (a) t = 2 (b) t = \ ? 

(2) What is t if (a) x = 1 (b) x = 2 (c) x = 4.5? 

To help you, I have shown these questions in Figure 8.B.2. (You will need to use your 
calculator to get the answers.) 




Figure 8.B.2 



(1) This is straightforward. Using x = e', we have 

(a) x = e 2 so x = 7.3891 to 4 d.p. using a calculator 

(b) x = e m so x = 1.3956 to 4 d.p. 

The first answer corresponds to the amount of money, measured in units of 
£100, which the fifth cousin would have after two years (if her uncle leaves the 
system of growth unchanged). This would be £738.91. The second answer 
corresponds to the amount she would have after j of a year or 4 months. This is 
£139.56. 
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This question is a bit more tricky because we want to go back the other way. We 
need to use the inverse function which will take us back from x to t. 

Because of the way it was obtained, the growth curve is smooth and has no 
gaps, so there will be a value of x for any particular value of t. 

We define the inverse function by introducing the natural log and saying 



if x = e* then t = log e x or In x. 



(Natural logs, that is logs to the base e, are usually written as In rather than log e .) 
This now gives us the answer for question (2)(a) that t = In 1 = 0 so 1 = e° which 
agrees with the meaning we gave to the power 0 in Section l.D.(b). 

It also agrees with the starting amount of money of 1 X £100 when t = 0. 

The answer for question (2)(b) is t = In 2 = 0.693 to 3 d.p. using a 
calculator. 

The fifth cousin would have £200 after 0.693 X 12 = 8.3 months 
approximately. 

Question (2)(c) has the answer of ln(4.5) = 1.504 to 3 d.p., giving the fifth 
cousin £450 after approximately l\ years. 

If we have a function x =J[t), then we write its inverse function (if it exists) in 
the form x =f~ 1 (t). 

Here, we have/(0 = e' and f~ l (?) = In t. 

Since doing the function followed by doing the inverse function brings you back 
to where you started, we have 



f- l (f(t)) = t and At 1 (t)) = t. 



For the particular functions of/(0 = e 1 and f 1 {() = In t, this gives us 



In (e') = t and e lnt = t. 



These equations are extremely useful and are worth surrounding in bright 
colour. 

I have sketched x =J(t) = e* and x =f~ l (t) = In t in Figure 8.B.3. 
Notice the following points here. 

• The sketch includes negative values of t. If t represents time, then these represent 
times before we started doing the measuring. 

• The value of e' is always greater than zero although, for large negative values of 
t, it gets infinitely close to zero. 

• We can only find the natural log of a positive quantity. (This is true for any log.) 
This agrees with 2~ 3 , say, being 1/2 3 = 1/8. 
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Figure 8.B.3 



8.B.(d) Relating In x to the log of x using other bases 

Starting from a similar situation in Section 3.C.(b), we defined the inverse function of 
f{x) = T as/- 1 (x) = log 2 x. 

It will now be of great practical importance to us to find a rule which will tell us how 
to write logs to other bases (in particular, base 10) in terms of logs to base e (or natural 
logs). 

To find this rule, we will start with some number a and suppose that log 10 a = y and 
In a = log e a = x. (In this section on changing bases, I will write the natural logs as log e rather 
than In to emphasise that these are logs to the base e.) 

If log 10 a = y then a = 10 y and if log e a = x then a = e x . (This is what 'base 10' and 'base 
e' mean.) 

But it must also be possible to write 10 itself as a power of e. 

Let's say that 10 = e c . This means that we can say that c = log e 10. 

(Using my calculator gives me c = 2.302 585 093 but this is only an approximation to nine 
decimal places. Any further rounding off will make it even more inexact so we'll carry on 
calling it c for short.) 

Now we say that a = W = (e c J = e cy . 

But also a = e x so now we have e x = e cy so x = cy. 

Putting back what x, y and c are in terms of logs gives us 



log e a = (log e 10) (log 10 a) or In a = (In 10)(log 10 a). 



It is also worth surrounding this in bright colour. 

We now have a rule which makes it possible for us to change a log to base 10 into a 
log to base e. (One way of remembering it is to think of it as sort of 'cancelling' the 10.) 
Try choosing some particular values for a and than check on your calculator that the rule 
does work. 
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Being able to write logs to base 10 in terms of logs to base e (that is, natural logs) will 
be very important when we want to find the rates of change of functions of logs to base 10. 
We shall see how to do this in Section 8.C.(c). 

The rule above can be extended to cover any change of base, say from m to n. 



log„ a = (log„ m) (log m a). 



This rule gives us a special case which is sometimes quite useful. If we put n = a and 

m = b, we get 



1 

log a a = 1 = log„ b log h a so log a b = . 

log 6 a 



We have now seen how logs to other bases can be converted into natural logs. It is 
possible to define all other logs and powers in terms of logs and powers of e, and this is done 
in the rigorous approach of mathematical analysis. It is then possible to give a meaning to 
such unnerving quantities as 2", for example. Doing this properly is a slow and careful 
process. In this book I try to give you enough examples of places where you need to be 
careful, to help you to understand why this detailed analysis is done. 

8.B.(e) What do we get if we differentiate In tl 

What is the rate of change of x = In t with respect to f! 
That is, what is dxldft 

dt 

If x = In t then t = e x so — = e x . 

dx 

But it seems reasonable in general to say that 

dx 1 

dt dtldx 
since we can say that the fraction 

dx _ 1 

dt dtldx 

Provided that none of the problems talked about in Section 8.A.(f) is present, then when 
dt — > 0, dx — > 0 also, so this step is justified. Now here we have 

dt dx I I 

— = e x so — = — = — . 
dx dt e x t 

This gives us the enormously important result that 



dx 1 

if x = In t then — = — . 

dt t 



This is another box worth surrounding with bright colour. 
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I should point out here that the letters we use are not important in themselves; they are 
just names or tags. 

So it is equally true, for example, that 



dy 1 
if y =fix) = In x then — =f'(x) = —. 

dx x 



8.C Differentiating more complicated functions 

Before we start looking at ways of how we can do this, I will collect together in a box all 
the functions we can now differentiate. Remember that the letters of the variables can be 
changed as you wish. (I have used y, x, t and 6 for mine.) 



Rates of change we already know 

(1) If y =fix) = ax" then dyldx =f'(x) = nax"' 1 . 

So if y = ax then dyldx = a and if y = a then dyldx = 0 (a stands for any 
constant number). 

(2) If x=J[t) = sin t then dxldt = f'(t) = cos t. 

(3) If x =J[t) = cos t then dxldt =f(t) = - sin t. 

(4) If x =J{t) = e' then dxldt =f'(t) = e'. 

(5) If x =J{t) = In t then dxldt =f(t) = lit. 



I have used the letter / for a function here, all through, but of course you can use other 
letters if you want. 



^^^^ Students sometimes mix up the minus sign in (2) and (3). There are two 
^H-^V ways you can use to remember that the minus sign comes when you 
differentiate a cos. 

• Remember the shape of the first bit of the sin and cos graphs. 

The cos graph is going downhill here, so dldt (cos t) must be - sin t. 

• Sin Differentiates Plus so Solve Damn Problem. 



8.C.(a) The Chain Rule 

It is often necessary to be able to find the rate of change of functions which have been built 
up from simpler ones. For example, we might have x =J(t) = sin 3t or y =j{x) = (3x 2 + 2) 5 
or y =J{6) = sin 3 26 etc. The Chain Rule gives us a way of dealing with all of these. 
I will explain how this works by showing you the following four examples. 

(1) y = (3x 2 + 5) 5 (2) x = sin(3f + tt/2) (3) y = e x2+2 (4) y = ln(2f 2 + 3?) 

Each of these is built up from functions which we can easily differentiate. 
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We can show this in the following way. 

(1) y = (3x 2 + 5) 5 becomes y = X s if we put X = 3x 2 + 5. 

(2) x = sin(3? + jt/2) becomes y = sin X if we put X = 3t + ji/2. 

(3) y = e x +2 becomes y = e x if we put X = x 2 + 2. 

(4) y = \n(2t 2 + 30 becomes v = In X if we put X = 2t 2 + 3t. 

In each of these, X stands for a whole lump or chunk which makes a second function. 

Taking example (1), we have y as a function not just of x but also of this X which is itself 
a function of x. 

It is for this reason that the Chain Rule is also known as 'function of a function'. 
Being able to write y in this way makes the finding of dyldx very much simpler because 
we can split it into two easy steps. 

We justify this by going back to the stage of the very small changes, and saying 

dy dy dX 

— = just using the ordinary rules of fractions. 

dx dX dx 

Now, provided none of the potential difficulties which we talked about in Section 8.A.(f) are 
present at any of the points we are interested in, so that as dx gets very small we also have 
dX getting very small, we can say that 

dy dy dy dy dX dX 

as dx — > 0, > — , > — and > — . 

dx dx dX dX dx dx 

This gives us the following result. 



The Chain Rule 

dy dy dX 

If y is a function of X, and X is a function of x, then — = . 

dx dX dx 



Using this now in each of the four examples which we had above, and changing the letters 
when necessary, we get the following results. 

dy dy dX 

(1) — = = (5X 4 ) (6x) = 30x (3x 2 + 5) 4 

dx dX dx 

Notice that I have given the final answer in terms of the original x. You should always do 
this. 

dx dx dX 

(2) — = = (cos X)(3) = 3 cos(3f + 7r/2) 

dt dX dt 



Remember here that Jt/2 is a constant, and so gives zero when it is 
differentiated. 



dy dy dX , 
(3) — = — — = (e x ){2x) = 2xe* +2 
dx dX dx 
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Using the Chain Rule also gives us the result that d/dt(e~ ( ) = -e~ l . This describes a process 
of decay where the rate of change of the substance present at time t is equal to minus the 
amount of the substance present at that time. The minus sign shows that this rate of change 
is negative, and the amount of the substance present is decreasing. 



You will avoid a lot of mistakes if you remember that if e x is differentiated 
with respect to X then the answer is e x . 

SO if yOU have e somethin 8 complicated then e the same something complicated mugt bg 

part of your answer when you differentiate. 



(4) 



dy dy dX 
dt dX dt 



1 \ At + 3 

- (At + 3) = — 

X It 2 + 3? 



EXERCISE 8.C.1 



Try these questions for yourself now. 

It is very important to be able to do these differentiations quickly and reliably 
because they will be the basic step of many further processes. (In particular, when 
you come to use partial differentiation, which involves having functions of more 
than one variable, you need to be able to do this process without any worries.) 

For this reason, I start off with easy questions and build them up gradually so 
that you can get really confident with them. 

I think you will find that quite quickly you can work with the X in your head, 
just writing down the two multiplied bits and then tidying them up for the final 
answer. 

Differentiate each of these functions with respect to the letter used in their 
description. 



(i) y = (2x 2 + 3)4 
(4) x = (3f + 4) l/2 
(7) y = 2e x ' +x 
(10) y = sin(x 2 ) 



(2) x = (f 3 + 2) 5 (3) y = (3X 2 - 2x) 4 

(5)/ = 3e 4X (6)x = e p + 1 

(8) x = cos(4f + Ji/3) (9) x = sin f + sin 2f 
(11) y = sin 2 x, which means (sinx) 2 . Hint: let X = sin x. 



(12) y = cos 3 x (13) y = In 4X (14) y = ln(3x + 1) (15) x = ln(2f 2 + 1) 
(16) y = cos(5x 2 + jt) (17) x = sin(2f 2 + 3) (18) y = ln(sin x) 



The next step is to be able to use the Chain Rule more than once in the same question. 
With practice on the easy ones (which are then often built into more complicated ones), you 
will find this no problem. 

Try these ten quickies now, doing the X part in your head. 



exercise 8.C.2 Differentiate each of the following with respect to x. 

(1) / = e 5X (2) y = e~ 2x (3) y = e x * (4) y = ln(2x + 3) 

(5) y = ln(i + x) (6) / = ln(i - x) (7) y = sin jx (8) y = cos 4 x 

(9) y = sin(2x + n) (10) y = cos(3x + 4) 

Now we are ready to do the functions of functions of functions. (In fact, you can chain 
together as many as you like, with them all folded inside each other like a set of Russian 
dollies.) 
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Here are two examples. 



example (1) Find dyldx for y = sin 3 (Ax) (which means, of course, y = (sin(4x)) 3 . 

We think of this first as y = X 3 , with X = sin Ax, and write 
dyldx = (3X 2 )(4 cos Ax) = 12 cos Ax sin 2 Ax. 

The second use of the Chain Rule, on the sin Ax, has become so 
automatic that you hardly notice that you are doing it. 

example (2) Find dyldt if y = ln(sin 3t). 

Thinking of this as In X, with X = sin 3t, we can write 

/ 1 \ 3 cos 3? 

dyldt = — (3 cos 3?) = = 3 cot 3t. 

\X) sin 3t 

exercise 8.C.3 Try these now for yourself. Differentiate each function with respect to the letter 

used in their description. 
(1) y = cos 5 2x (2) y = sin 3 (4X + 1) (3) x = ln(sin(2f + 3)) 

(4) x = (2 cos 26 + s) 3 (5) / = ln(i + cos x) (6) x = ln(3f + sin 2 3O 
(7) x = ln(2 + e f2 + 1 ) (8) y = sin(cos 4x) (9) y = (1 + sin 2 f) l/2 

(10) y = ln[(i + sin 2 f) l/2 )] (This is easier than it looks. Think!) 

8.C.(b) Writing the Chain Rule as F(x) = f'(g()Q) g'(x) 

You may come across the Chain Rule written in the dash notation for functions as above. It 
means exactly the same thing as what you have just been doing. I will show you how this 
is so by taking an example. 

Suppose we want to differentiate y = (3x 2 + 2) 4 with respect to x. 

Because we are using function notation, we need to label the three functions involved 
here with different letters. 

I shall let y = F(x) = (3x 2 + 2) 4 . 

Now y is also a function of (3x 2 + 2). (This is what we have been calling X.) 
I shall let X = 3x 2 + 2 = g(x), to show that it also is a function of x. 
Since y is a function ofX, we can also write y =J[X) =J[g(x)). (In this particular example, 
y=A^ 2 + 2).) 

Next, it is important to be sure what the dash notation means for a function. 
f(x) means the function J[x) differentiated with respect to x, 
f'(t) means the function fit) differentiated with respect to t. 

f'(X) means the function f[X) differentiated with respect to X, even though X is itself a 
function of x, so f'(g(x)) means the function f[g(x)) differentiated with respect to g(x). 
It corresponds to what we have called dy/dX. 

So, in this particular example, g'(x) = 6x and f'(g(x)) = A(3x 2 + 2) 3 . So we have 

F'{x) = [4(3x 2 + 2) 3 ] [6x] = 24x (3x 2 + 2) 3 . 
Use whichever notation you prefer. 

8.C.(c) Differentiating functions with angles in degrees or logs to base 10 

When we showed that 

d 

— (cos t) = -(sin t) 

dx 

in Section 8.A.(e) everything ran smoothly because the angle t was in radians. 
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How could we find the slope of the graph of x = cos 0 if 0 is in degrees? (We know that 
we can draw the graph of x = cos 0. The only difference is that the horizontal scale will be 
in degrees instead of radians.) 

In order to find dxIdO from x = cos 9 we shall first have to convert 0 to radians. 

From Section 4.D.(a) we have 

/ 71 \ (3t0\ 

6° = 0 radians, so x = cos 

\ 180/ \ 180/ 

with the angle now in radians. 

We also know from the Chain Rule that, if a is some constant number, and x = cos (ad), 
then dxldd = - a sin 9. 

Exactly the same principle works here with a = tt/180. 

/ JlQ \ dx 71 I Ttd \ 71 

If x = cos then — = sin = sin 0 

\ 180/ de 180 \ 180/ 180 

writing the angle again in degrees. 

The tt/180 is the gearing mechanism or scale factor which lets us have the horizontal 
scale in degrees instead of radians. 

We can use a similar process to differentiate a function in terms of logs to base ten (or 
any other base, but ten is the one you are most likely to want to use). 

To do this, we go back to the relationship between logs to base e and logs to base 10 
which we found in Section 8.B.(d). This says that 

In a = (In 10)(log 10 a). 

So, for example, if we want to find dyldx for the function y = log 10 x, we rewrite this as 
In x 

y = • 

In 10 

Now 1/ln 10 is just a number, so we have 

dy 1 /1\ 1 

dx In 10 \x) x In 10 ' 

Here, the l/(ln 10) is acting as a gearing mechanism or scaling factor which makes the 
differentiation work in the slightly altered circumstances of a different base. 
We now have the following two rules. 



To differentiate functions involving degrees, convert first to radians. 

To differentiate functions involving other logs, convert first to natural logs. 



8.C.(d) The Product Rule, or 'uv' Rule 

The Product Rule moves us a further step on in being able to differentiate functions which 
are built up from simpler functions. It is therefore another technique we will need for 
practical applications. 

As its name suggests, it gives us a way of dealing with two functions which are multiplied 
together to give a third function. 

For example, suppose we have y = fix) = 3x 2 sin 2x. 
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The function /(x) is made up of two functions, u(x) = 3x 2 and v(x) = sin 2x, which are 
multiplied together. So we can say y = uv. 

If we alter x by a small amount dx then y will also alter by a small amount by. Also the 
two components, u and f, of j will each alter by small amounts since they are also functions 
of x. (We are assuming here that none of the complications of Section 8.A.(f) is present.) So 
we can say that u alters by the small amount bu and v alters by the small amount bv. 

This gives us 

y + dy = (u + bu)(v + bv) = uv + v (bu) + u (bv) + (<5w)((5t/). 
But y = uv, so 

(5y = u (bv) + v(bu) + (bu)(bv). 
Dividing all through by bx gives 

by bu bv (bu)(bv) 

— = v — + u — + . 

bx bx bx (bx) 

Now, if we make bx become smaller and smaller, so bx — > 0, then bu and bv will also 
become very small. 
This means that, 

(bu)(bv) 

as bx — > 0, > 0 also. 

(bx) 

Two very small things multiplied together and then divided by one very small thing give a 
very small result. This result will become closer and closer to zero as bx itself becomes 
closer and closer to zero, so we now get the result that 

by dy 
the limit of — as bx — > 0 is — . 

bx dx 

This gives us the following. 



The Product Rule 

dy du dv 
— = v — + u — 
dx dx dx 



In the particular example that we started with, duldx = 6x and dv/dx = 2 cos 2x so we have 
dyldx = (sin 2x)(6x) + (3x 2 ) (2 cos 2x) 

= 6x sin 2x + 6x 2 cos 2x = 6x (sin 2x + x cos 2x). 
The Product Rule can also be written in function notation as 



if j = uv then y' = vu' + uv' . 



This covers the case of y, u and v all being functions of x, or all being functions of any 
other letter which it might be convenient to work with. 
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exercise 8.C.4 Try these for yourself, tidying up all the answers as far as possible. Find dy/dx for 

each of the following. 
(1) y = jx 2 cos 3X (2) y - e 3X sin ix (3) y = 4X 5 (x 2 + 3P 

Find dx/dt for each of the following. 
(4) x = e 2t + 1 cos (2f + 1) (5) x = 7f 2 In (2f - 1) (6) x = (f 2 + i) l/2 sin (2f + tt) 

(7) Find c/y/c/x if y = (x 2 + 1) 5 e 3 * cos 2x. 

If you have three functions multiplied together like this, there is no special new 
Product Rule which you should use. You just bunch any two of the functions 
together and then use the Product Rule twice. 

Here, you could say y = [(x 2 + 1) 5 ] [e 3 * cos 2x] and go on from there. 

In the following questions, you will need to remember that 

d 2 x d (dx 

means — — 

dt 2 dt \ dt 

so to find d z x/dt 2 you differentiate twice. 

These questions are included here not just as practice in differentiating but 
because, if you have seen them working this way round, they will then be easier 
for you to solve when you come to do the opposite process in real-life physical 
applications. There you will be starting with the differential equation (that is the 
equation which has the terms in d 2 x/dt 2 and dx/dt) and finding a solution which 
fits it. 

dx d 2 x 

(8) If x = (2 + t)e 3 ' find (a) — and (b) . 

dt dt 2 

d 2 x dx 

Show that 6 — + ox = o. 

dt 2 dt 

(9) If x = e kt where k stands for some constant number, find (a) dx/dt and 
(b) d 2 x/dt 2 . 

d 2 x dx 

If 2 3X = o find the two possible values of k. 

dt 2 dt 

(10) If x = Ae 3t + Be' 1 , where A and B are standing for constant numbers, show that 

d 2 x dx 

2 3X = o. 

dt 2 dt 

(There is a very quick way to do this one; look at your answer to the previous 
question.) 

dx 1 

(11) If x = e _t In (1 + e f ) show that — + x = 



8.C.(e) The Quotient Rule or 'u/v' Rule 

This rule gives us a good way of differentiating a function which is made up of two simpler 
functions written as a fraction. 

We start with a function y =fix) = u/v where u and v are both themselves functions of x. 

The following result can then be shown by a very similar argument to the one we used 
for the Product Rule in Section 8.C.(d). 



8.C More complicated functions 



315 



The Quotient Rule 

u dy v (du/dx) - u (dv/dx) 

If y = — then — = 

v dx v 



5 



Notice the minus sign in the middle of the Quotient Rule. Because of this it 
matters what order the top two bits are written in. This is why I wrote the 
Product Rule in the same order. Then V comes virst' for both. 



Because the Quotient Rule automatically tidies up the answer by putting it 
over a common denominator, I think that it is easier to use it for a function 
like y = j{x) = 2x/(3x - 1), rather than writing this as 2x(3x - 1) _1 and then 
using the Product Rule. 



Here are two examples of using the Quotient Rule. 

example (1) We can use it to find out what the answer is if we differentiate y = tan x 
with respect to x. We write 

sin x 

y = tan x = so u = sin x and v = cos x. 



COS X 



So 



But 



dy (cos x)(cos x) - (sin x)(- sin x) cos 2 x + sin 2 x 
dx 



cos 2 X 



cos 2 X 



cos 2 x + sin 2 x = 1 so 



Therefore, we have 



dy 1 



dx cos 2 x 



= sec x. 



d 
dx 



(tan x) = sec 2 x. 



example (2) We will use the Quotient Rule to find dyldx if y = 

u = x + 3 and v = x - 2 so we get 

<fy = fr- 2)(1) - (x + 3)(1) 5 
dx 



x + 3 
x - 2 



(x-2) 2 (x-2) 2 

This is undefined when x = 2, but otherwise it will always be negative 
since (x - 2) 2 must be positive. 



316 



Differentiation 



The value of dyldx at any particular point of a curve is telling us the 
slope of the curve at that point. You can see how it tallies with the 
shape of the curve for this particular function because we sketched it in 
Section 3.B.(i). We thought there, from the information that we then 
had, that this curve should always have a negative slope except where 
x = 2 when y itself was undefined. Now we see that this is indeed true! 

Knowing dyldx gives us a rule for finding the slope at any particular 
point of the curve. We can see this here by taking a couple of examples 
of points on this curve, say^, (3,6), and B, (- 4, -g). 

dy dy 5 

We get at A — = -5 and at B — = - — . 

dx dx 36 

These gradients agree well with the sketch; we can see that the tangent 
at B would be much less steep than the tangent at A. 



The Quotient Rule can also be written in function notation like this. 



u vu - uv 
If y = — then y' = 



exercise 8.C.5 Try these questions yourself now. 



cos x d 

(1) By writing cot x = show that — (cot x) = - cosec 2 x. 

sin x dx 

i d 

(2) By writing sec x = or (cos x) \ show that — (sec x) = sec x tan x. 

cos x dx 

d 

(3) Show similarly that — (cosec x) = - cosec x cot x. 

dx 

dx sin r 

(4) Find — if x = 



dt 1 + cos f 



d 

(5) Show that — (In (sec x + tan x)) = sec x. 
dx 



dy e x - e 
(6) Find — if y = — 



dx 



dy 1 1 + x 

(7) Find — if y = In 



dx \ 1 - x , 

(Think how you can make this one simpler to do!) 

dy x 2 sin x 

(8) Find — if y = . 

dx (3 - x) 
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(9) Find — if / = with x * -\. 

dx ix + 3 

dy ax + b 

(10) Find — if y = 

dx cx + d 

where a, b, c and d are all constant numbers, and x * - d/c. Are there any 
values of x which make dy/dx = 0? 



Here is a summary of the new useful results we now have. (We also have the box of results 
at the beginning of Section 8.C.) 



More rates of change we now know 

• If y = tan x then dy/dx = sec 2 x. 

• If y = cot x then dy/dx = - cosec 2 x. 
•lfy = sec x then dy/dx = sec x tan x. 

• If y = cosec x then cfy/c/x = - cosec x cot x. 

• If y = ln(sec x + tan x) then ofy/dx = sec x. 



It is worth highlighting this box because, when you come to do the process of 
differentiation the opposite way round in the next chapter, being able to spot these will be 
very helpful to you. 




8.D The hyperbolic functions of sinh x and cosh x 

Now that we know the Chain, Product and Quotient Rules for differentiation we are able to 
look at an interesting extension of the two graphs of y = e x and y = e~ x . 

8.D.(a) Getting symmetries from e x and e~ x 

The graph of y = e x is not symmetrical and neither is the graph of y = e~ x , and yet the two 
graphs shown together have a striking mutual symmetry which is clear from Figure 8.D.I. 
This is because each is the mirror image of the other in the j-axis. 
Can we exploit this? 
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Figure 8.D.1 



If we create a new function by taking the average value of e x and e~ x for each value of 
x, we shall get the function which I have shown by the dashed line in the sketch. It is called 
y = cosh x. The reason for this is that it behaves in many ways like cos x, curious though this 
may seem at first sight. Its equation is given by 



y = cosh x = 



e x + e 



This function is even, that is, cosh(-x) = cosh x for any particular value of x. 

It describes the curve in which a heavy uniform chain hangs under its own weight. It also 
describes the sag in a metal tape measure when it is extended, and was used to correct for 
this before the invention of electronic measuring devices. 

e x + e~ x e x — e~ x 

If y = gives an interesting result, what about y = ? 

2 2 

We can think of this as finding the average value of e x and - e~ x for each value of x. 
This gives us the curve shown as a dashed line in Figure 8.D.2. 

This function is called sinh x and it is odd. That is, sinh x = - sinh(-x) for any particular 
value of x. 

We now have the pair of definitions 



cosh x = 



e x + e 



and sinh x 
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Figure 8.D.2 



Remembering from the rules for powers that 

e x X e x = e x + x = e 2x 

e~ x X e~ x = e~ x ~ x = e~ 2x 
and e x X e~ x = e x ~ x = e° = 1 , 
we have 

(e x + e- x \ 2 e 2x + e- 2x + 2 

cosh 2 x = (cosh x) 2 = = . 

\ 2 J 4 

^^^^ cosh 2 x is the way in which mathematicians write (cosh .v) 2 . It does not 
^H-^V mean cosh x 2 , which is more safely written as cosh(x 2 ). In fact 

2 _ 2 

e x + e~ x 

cosh(x 2 ) = . 

2 

It is just the same as cosh x except that the x is replaced by x 2 . 



We also have 

/ e *- e -x\2 e 2x + e -2x_ 2 

sinh 2 x = = 

\ 2 J 4 

So 



cosh 2 x - sinh 2 x = 1 . 



This is true whatever value we choose for x on the x-axis, so it is an example of an 
identity. I described some examples of identities in Section 2.D.(h). 
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Try showing for yourself that cosh 2 x - sinh 2 x = 1 , without looking at my working, to 
make sure you can do it. 

We begin to see now just why cosh x and sinh x have been named in this way. The above 
relationship is curiously like the trig identity of cos 2 x + sin 2 x = 1 . 

8.D.(b) Differentiating sinh x and cosh x 

We know that dldx (e x ) = e x and dldx (e~ x ) = - e~ x . 

What do we get if we differentiate (a) y = sinh x and (b) y = cosh x with respect to x? Have 
a go at doing this for yourself. 



This is what you should have. 

J/Jx(sinh x) = dldx (5 (e x - e~ x )) = \ {dldx (e x ) - dldx(e~ x )) 
= \ (e x + e~ x ) = cosh x 

and, similarly, 

dldx(cosh x) = sinh x. 

Again we see that sinh x and cosh x are behaving very similarly to sin x and cos x, though 
not quite identically since dldx(sin x) = cos x but d/dx(cos x) = - sin x. 

This seems very strange just now, because we have completely different graphs for these 
two pairs of functions. The mystery of this curious set of links becomes solved later on, in 
Section lO.C.(b). 

Also, just as we did with sin and cos, we can use the Chain Rule to differentiate slightly 
more complicated functions involving sinh and cosh. For example, 

d d 

— (sinh 3x) = 3 cosh 3x and — (cosh (x + 1)) = 2x sinh (x + 1). 

dx dx 

8.D.(c) Using sinh x and cosh x to get other hyperbolic functions 

Because of the similarities which we have already seen, it makes sense to define further 
hyperbolic functions to correspond to the other trig functions, so we say 



sinh x e x - e~ 

tanh x = 



cosh x e x + e x 

1 1 1 
cosech x = , sech x = , coth x 



sinh x cosh x tanh x 

Dividing cosh 2 x - sinh 2 x = 1 by cosh 2 x gives us 

1 - tanh 2 x = sech 2 x 
and dividing cosh 2 x - sinh 2 x = 1 by sinh 2 x gives us 

coth 2 x - 1 = cosech 2 x 
again similar but not identical results to the two trig rules of 

tan 2 x + 1 = sec 2 x and cot 2 x + 1 = cosec 2 x. 
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We can now use the Quotient Rule to find dldx (tanh x). Writing 
sinh x 



tanh x = 



we get 



cosh x 



(cosh x)(cosh x) - (sinh x)(sinh x) 1 
d/dx(tmh x) = ; = ; — = sech 2 x. 



cosh 2 x 



cosh 2 x 



(You can get this same result by working directly with tanh x written in terms of e x and e~ 
but this is longer. It was question (6) in Exercise 8.C.5.) 
Show for yourself that the following three rules are true. 





d 




(1) 


dx 


(sech x) = - sech x tanh x 




d 




(2) 


dx 


(cosech x) = - cosech x coth x 




d 




(3) 




(coth x) = - cosech 2 x 


dx 



(The working for these is very similar to the working for the corresponding trig functions 
which came in Exercise 8.C.5.) 



EXERCISE 8.D.1 



(1) If e x = 2, find the values of (a) sinh x, (b) cosh x and (c) tanh x by using their 
definitions in terms of e x and e~ x . 

(2) If x = 0, find the values of (a) sinh x and (b) cosh x. Check that your answers 
are believable by looking at the graph sketches of these two functions. What 
is tanh x when x = o? 



(3) Differentiate the following with respect to x. 



(a) y = cosh 2x 
(d) y = tanh 5X 



(b) y = sinh (3X + 5) 
(e) y = In (cosh x) 



(c) y = e 2x sinh 3* 
(f) y = cosh 2 3* 



8.D.(d) Comparing other hyperbolic and trig formulas - Osborn's Rule 

In this section, we look at whether some other rules which are true for trig functions are also 
true for hyperbolic functions. 

(1) In Section 5.D.(d), we showed that sin 2A = 2 sin A cos A. 
Is it true that sinh 2x = 2 sinh x cosh x? 

We look at the more complicated side first and see whether it will simplify to 
give the other side. Doing this gives us 

£ 2x _ e -2x 

2 sinh x cosh x = 2 I J I | = = sinh 2x, 

so this is another rule which transfers exactly. 
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(2) 



Investigate for yourself whether the trig rule of cos 2A = 
corresponding rule for hyperbolic functions of cosh 2x = 
Indeed, could this be so? 



cos 2 A - sin 2 A has the 
cosh 2 2x - sinh 2 2x. 



I hope you will have seen straight away that it couldn't be so since we know that 
cosh 2 x - sinh 2 x = 1 . Try finding for yourself what cosh 2 x + sinh 2 x is equal to. 



You should have 



8.D.(e) 



-x\2 



-x\2 



cosh 2 x + sinh 2 x = 



= \ {(e 2x + 2 + e- 2x ) + (e 2x + e~ 2x - 2)) 

= \ (e 2x + e~ 2x ) = cosh 2x. 

This time we have the two rules 

cos 2x = cos 2 x - sin 2 x and cosh 2x = cosh 2 x + sinh 2 x. 

The different results of (1) and (2) are examples of Osborn's Rule which says that the trig 
rules match the corresponding hyperbolic rules exactly, unless the working somewhere 
involves two sins or two sinhs multiplied together. In this case, there is a sign change there. 

Finding the inverse function for sinh x 

We look now at the function y = sinh x to see whether we can find a function that will take 
us back the other way. We'll start by considering a numerical example so that we can see 
what is happening here. 

Suppose we know that sinh x = 2. What value of x would give this result? 

I show this question pictorially in Figure 8.D.3. 




Figure 8.D.3 



We say that x = sinh 1 2 meaning that x is the number whose sinh is equal to 2. 



sinh 1 x does not mean 1/sinh x. This would be written as (sinh x) l . 
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Using a sequence like INV-HYP-SIN on your calculator should give you the answer of 
x = 1.44 to 2 d.p. but how can we show this process actually happening? We have 

e x — e~ x 

sinh x = = 2 so e x - e~ x = 4. 

2 

Multiplying through by e x gives e 2x - 1 = 4e x so e 2x - 4e x -1 = 0. 

This is actually a quadratic equation in e x , which we can see by putting e x = m. 
This gives us m 2 - Am - 1 = 0. We now use the formula to get 

4 ± Jl6 + 4 4 ± 2 J5 r- 

m = = = 2 ± J 5. 

2 2 

Now, e x = 2 - y[5 is not a possible solution, because e x is always positive. 
Therefore we have e x = 2 + yj~5 so x = In (2 + y5) = 1 .44 to 2 d.p. 

Having seen what happens with this particular example, we will now see how we can find 
a general rule for y = sinh -1 x. 

We use exactly the same method that we did with the numerical example. We start with 

e x -e- x 

y = sinh x = so 2y = e x - e x so e x - 2y - e x = 0. 

Multiplying through by e x gives 
e 2x _ 2y e * _ i = o. 

Again, this is a quadratic equation. We see this very nicely by putting m = e x . 
Then we have m 2 - 2y m - 1 = 0, and using the formula gives 

2y ± vV + 4 2y ± 2yjy 2 + 1 r - 

m = so m = = v± A /v z +l. 

2 2 

Replacing m by e x gives us 
e x = y ± + 1. 

Now, e x is always positive for every x which we can choose on the x-axis. However, 
y - yfy 2 + 1 is always negative since \Jy 2 + 1 > j. 

Therefore we cannot have e x = y - ^y 2 + 1. 

This gives us just the single possibility of = _y + ^/j 2 + 1. 

Taking natural logs of both sides of this equation, we get 

ln(e x ) = x = ln(y + v 'j 2 + 0- 

We now have the rule for finding the original x if we know what y is, but it is giving us x 
as a function ofy. We can see this from the direction of the arrows in Figure 8.D.4(a) which 
shows sinh x = 1 giving x = 0.88, and sinh x = 3 giving x = 1.82 to 2 d.p. 

We want a rule which will give us y as a function of x so we interchange x and j. 

This gives us the inverse function of 



y = sinh 1 x = ln(x + ^x 2 + 1). 



Try feeding in x = 1 and x = 3 to this, so that you can see it actually working. 
I show a sketch of this function in Figure 8.D.4(b). 
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Figure 8.D.4 

The interchanging of x and y means that, as for every function and its inverse, the graphs 
of y = sinh x and y = sinlr 1 x are symmetrical about the line y = x. 

If you draw your own sketch, showing both y = sinh x andj = sinh -1 x together, you can 
see this symmetry 

We can also see graphically in Figure 8.D.4(a) that y = sinh -1 x must be a function 
because there is only one value of x which can give a particular value of sinh x, so there will 
be no ambiguity when we want to go back the other way 

exercise 8.D.2 To extract as much information as possible from the two graphs above, and from 

Section 8.D.(b), try answering the following questions yourself. 

(1) What is the gradient of the curve y = sinh x at the origin? 

(2) From your answer to (i), what special property does the line y = x have? 

(3) From the symmetry of the two graphs, what is the gradient of the curve 
y = sinh -1 x at the origin? 

8.D.(f) Can we find an inverse function for cosh x? 

Again we start by looking at a numerical example. 

If cosh x = 2, what value of x could have given this result? 

We see immediately from Figure 8.D.5 that there will be two possible values of x. This 
is because cosh(x) = cosh(-x) for all values of x. 

Doing the working in exactly the same way as we did for sinh x = 2 in Section 8.D.(e), 
we find that e x = 2 ± v 3. (Do this for yourself.) 

Both these possibilities are positive so they are both possible solutions. 

If we take e x = 2 + ^3 we get x = In (2 + ^3) = 1.32 to 2 d.p. 

If we take e x = 2 - y[3 we get x = In (2 - ^3) = -1.32 to 2 d.p. 




Figure 8.D.5 
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Looking at the numbers in these two logs, it may seem surprising to you that they do give 
a matching pair of plus and minus answers. We shall see why this is so when we find a 
general rule for y = cosh -1 x. 

You will find that your calculator only gives you the answer of x = 1.32 to 2 d.p. for 
cosh -1 2. 

The reason for this is that, just as we saw with the inverse trig functions in Section 
5.A.(g), it is much more convenient to arrange things so that we have a single-valued answer 
and therefore a function. We can do this here by restricting ourselves to the right-hand side 
of the graph so that x > 0. We then get only one possible answer for x from each value of 
cosh x. 

Now we look for the general rule for cosh -1 x 

The procedure is very similar to what we did for sinh -1 x in the last section. 
See how far you can get by yourself. 



You should have 

e x + e~ x 

y = so 2y = e x + e x so e x - 2y + e x = 0 



so 



so 



e 2x - 2y e x + 1 = 0 so m 2 - 2y m + 1 = 0 putting m = e x 
2y ± vV - 4 2y ± 2^y 2 - 1 



m = 



= y± Vj 2 - 1 = e x . 



Both of these possibilities are positive, so we find that we are getting two possible solutions. 
We have 

e*=y± h 2 ~ 1 
so, taking natural logs, 

x = My ± vV 2 - 1). 

It is a nuisance having a general formula with this ± in the middle of the log where we can't 
easily get at it, so now we use a cunning trick involving the difference of two squares to put 
it somewhere better. 
It goes like this: 

0 + v> - i) 

(multiplying top and bottom by the same 
thing leaves the value unchanged) 

y 2 - (y 2 - 1) 1 



y + b 2 - i y + b 2 - i ' 

Why is this any better? 

It is because, if we have ln(l/a), this is the same as In 1 - In a, using the second rule of 
logs. These rules are listed in Section 3.C.(d). 
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But In 1 = 0 because e = 1. You can see that this agrees with Figure 8.D.I. So 
ln(l/a) = - In a and In f ) = - ln(y + Jy 2 - 1). 

\j + V/- 1/ 

This gives us the two solutions that x = ± ln(y + ^/j 2 - 1) and we see now why In (2 - y3) 
= - In (2 + ^/3) in the numerical example earlier. 

We now have the two possible values for x from a given y value. 

Interchanging x and y so that we can write this as a relation for y in terms of x, we 
have 

j = ± In (x + V* 2 - !))• 



If we restrict the x values by saying x > 0, we have the inverse function of 

y = cosh -1 x = In (x + ^/x 2 - 1)). 
This is called the principal inverse function for cosh. 




Figure 8.D.6 



I show the two functions, y = cosh x and y = cosh 1 x for x > 0 in Figure 8.D.6. Just as 
with any inverse pair of functions, they are symmetrical about the line y = x. 

8.D.(g) tanh x and its inverse function tanh" 1 x 

What will the graph of y = tanh x look like? It is not possible to get this one quite so simply 
from the graphs of y = e x and y = e~ x . We have 

sinh x e x - e~ x 

y = tanh x = = . 

cosh x e x + e~ x 

Try answering the following questions yourself. 

(1) What is tanh (0)? 

(2) Can you work out the connection between tanh (-x) and tanh (x)? 

What will this mean for the graph sketch? 

(3) Multiply the top and bottom of the fraction (e x - e~ x )/(e x + e~ x ) by e~ x . 

From your answer to this, can you see what happens to the values of tanh x 
when x takes very large positive values? 
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Now try multiplying the top and bottom of the original fraction by e x . Can you 
see what will happen to the value of tanh x when x takes large negative values? 

(You could check that your ideas are right by choosing some particular large 
positive and negative values for x and using your calculator.) 

(4) What is the gradient of the curve y = tanh x at the origin? 

(You may need to look at Section 8.D.(c) to answer this question.) 

(5) See if you can use all the information from your answers to the previous questions 
to draw a sketch of the graph for y = tanh x. 



You should have the following answers. 

(1) tanh (0) = 0 because sinh (0) = 0. 

(2) Replacing x by -x gives 



tanh (-x) = = - = - tanh x 

e -x _|_ g x \e x + e~ x I 

so the left-hand side of the graph will be given by reflecting the right-hand side of 
the graph in the j-axis and then turning it upside down, y = tanh x is an odd 
function, just like y = tan x. (We drew this in Section 5.A.(e).) 

(3) You should get 

e -x ^ g x _ g-x-j 2 _ e ~ 2x 

tanh x = = . 

e~ x (e x + e~ x ) 1 + e~ 2x 

The value of tanh x will become closer and closer to one as the value of x increases 
because e~ 2x becomes extremely small when x takes large positive values. 

Similarly, multiplying the top and bottom of the fraction by e x shows that tanh x 
gets closer and closer to -1 when x takes large negative values, since e 2x then 
becomes extremely small. 

(4) dldx (tanh x) = sech 2 x, so the gradient of y = tanh x when x = 0 is 1, because 
sech(0) = 1 . Also, since sech 2 x is positive, the gradient of y = tanh x is always 
positive. 

(5) Putting all this information together gives us the graph sketch shown in Figure 
8.D.7. 

The lines y = 1 and y = -1 are horizontal asymptotes for this graph. 
I have also drawn on the graph a line showing how we could find the value of 
x when tanh x = \. 
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If you use your calculator to find tanrr 1 (5), you will get x = 0.55 to 2 d.p. 

We can see from the shape of the graph that each value of tanh x can only come 
from one possible value of x, so therefore the function y = tanh x will have an 
inverse function. Now we'll find the rule that gives us this. We have 

e x — e~ x 

y = tanh x = — so y(e x + e~ x ) = e x - e~ x . 

e x + e~ x 

Multiplying all through by e x gives y(e 2x + 1) = e 2x - 1, so 

ye 2 * + y = e 2x - 1 and y + 1 = e 2x - ye 2x = e 2x (\ - y). 
Therefore 

e = . 

l-y 

Taking logs both sides, we have 

ln(e 2x ) = 2x = In so x = \ In 

\ \ -yj \ 1 -y, 

We now have the rule to get back to the original x if we know y. Use it to check 
that, if you put y = j, you do get x = 0.55 to 2 d.p. 

Interchanging x and y as before, so that we have this rule as a function of x, we 
get the inverse function of 



tanh 



-1 



In 



To give the log of a positive quantity, the possible values of x will have to lie 
between -1 and +1. 

We can see that this is where the values of x must lie from looking at the graph 
sketch of y = tanh -1 x which I have drawn with y = tanh x in Figure 8.D.8. 




tanh X 



Figure 8.D.8 
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I have used the line of symmetry y = x to draw this sketch. I have also used the 
answer to Question (4) which was that the gradient of y = tanh x when x = 0 is 1 . 
This means that y = x is a tangent to both y = tanh x and y = tanh -1 x. It is a very 
interesting tangent because it crosses both of the curves, which sort of flex 
themselves when x = 0. The line y = x does exactly the same thing with_y = sinh x 
and_y = sinh -1 x at the origin, as you'll see if you draw it in on Figure 8.D.4(a) and 
(b). We shall look at points of inflection like this in more detail in Section 
8.E.(b). 

You may find it helpful here to emphasise the separateness of the two curves by 
using two colours on them. Be careful to put the colour correctly on the two 
separate halves of each graph! (The tanh graph is a flattened S shape.) 

We were able to see from the graph that y = tanh x must have an inverse function, but 
suppose we didn't know what the graph looked like? Can we still show that the inverse 
relation will be a function? 

To do this, we have to show that it isn't possible to get the same value for tanh x from 
two different values for x, so that, when we go back the other way, there is only one possible 
answer. 

In other words, we have to show that the only way that tanh a = tanh b is for a and b to 
be themselves equal. 

We put tanh a = tanh b so 




and see what happens. Try tidying this up for yourself, and see if you can show that a and 
b must be equal. 



Multiplying by (e a + e a )(e h + e h ) to get rid of fractions, we get 

(e a - O (e h + e~ b ) = (e h - e~ b ) (e a + 

so 

e (a + b) _ e (b-a) + e (a-b) _ g -(a + b) = g (a + b) _ g (a-b) + g (b-a) _ g -(a + b) 

SO 

2e (a-b) = lg (b-a) SQ a _ b = b _ a so 2a = 2b and a = b. 
We've now shown that the inverse function does exist, without reference to the graph. 

^^^^ Remember that it is not true that e" X e b = e ab . We must add the powers. 

8.D.(h) What's in a name? Why 'hyperbolic' functions? 

The mystery of why sinh x and cosh x are called hyperbolic functions has not yet been 
explained. This section tells you why this is so. 

Suppose we let x = cosh 9 and y = sinh 9 and then plot the points that we get for different 
values of 9 on a graph. For example, if 9 = 0, we have x = cosh 9 = 1 and y = sinh 9 = 0, 
so one point on this graph will be (1,0). 
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Since cosh 2 9 - sinh 2 9 = 1, we know that the equation of this graph will be x 2 - y 2 = 1. 
This is the equation of the hyperbola which I show below in Figure 8.D.9. 




Figure 8.D.9 



This graph may look a more familiar shape if you turn it through 45° anticlockwise. The 
two dashed lines make this resemblance easier to see. 

Actually, only the right-hand side of it is given by x = cosh 9 andy = sinh 9. Can you see 
why this is? Can you think of a way that we could get the whole graph? 



cosh 9 can't be negative, and the points on the left-hand side of the graph have negative 
values for x. 

We could get the whole graph by putting x = sec 9 and y = tan 9. 

Since sec 2 9 - tan 2 9 = 1, we still have x 2 -y 2 = 1, and we have the left-hand side of the 
graph too, since sec 9 can take negative values. 

In a similar way, x = cos 9 and y = sin 9 are linked to the circle x 2 +y 2 = 1 . Indeed it was this 
circle which we used to define the sin and cos of angles greater than 90° in Section 5.A.(c). 

The variable 9 which we have used for this hyperbola and circle is called a parameter. 
We can get other curves of the same type by subtly adjusting how we use it. For example, 
x = 2 cosh 9 and y = 3 sinh 9 gives the hyperbola (x/2) 2 - (y/3) 2 = 1 and x = 5 cos 9 with 
y = 5 sin 9 gives x 2 + y 2 = 25, the circle with centre (0,0) and radius 5 units. 

Unbalancing them to give x = 4 cos 9 and y = 3 sin 9, say, gives a squashed circle, or 
ellipse, with the equation (x/4) 2 + (y/3) 2 = 1. This is centred at the origin and cuts the axes 
at (4, 0), (0, 3), (-4, 0) and (0, -3). 

There isn't space to go into this in more detail just now, but you will find that this use 
of parameters to describe particular curves is often of great practical use in extracting further 
information from relationships between physical quantities. 

Finally, you may be thinking that the name 'hyperbolic' isn't the only strange thing about 
these functions. Why is there this curious link between them and the trig functions? I'll show 
you the reason for this in Section 10.C.(b). 

8.D.(i) Differentiating inverse trig and hyperbolic functions 

This is something which students quite often find difficult, but if you have worked through 
the earlier parts of this section so that you are now happy with what these inverse functions 
do, you should find it quite straightforward. We'll look at two examples of differentiation, 
and then see how using the Chain Rule makes it possible to get lots of other similar results 
very easily. 
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example (1) How can we find dyldx if y = sinh 1 x? 

We could set about doing this in two ways. 

Method (i) Let y = sinh - 1 x. Then x = sinh y because this is what the inverse 
function of sinh - 1 means. Therefore dxldy = cosh y. 
Now we use the argument of Section 8.B.(e) to say 

dx 1 

dy dyldx 

excluding any values of x for which dyldx = 0. 

(It is also possible to do this by implicit differentiation. I show you 
this method in Section 8.F.(c).) 

Therefore 

dy 1 
dx cosh y 
But cosh 2 y = sinh 2 y + 1 , so 

cosh 2 y = x 2 + 1 and cosh y = ± ^x 2 + 1. 

But we know that the gradient of y = sinh x is always positive. (How 
do we know this? What is dldx (sinh x)?) 



It is cosh x and cosh x is always positive. Therefore 
dy_ = 1 
dx V* 2 + 1 
and we have the result that 



d _ 1 
(sinh 1 x) = — 



dx" V* 2 + 1 



Method (2) This uses the result which we found in Section 8.D.(e) that 
sinh - 1 x = In (x + ^x 2 + 1). 
Therefore we can say 



! 'c 2 + 1)" 1/2 

dx dx x + ^x 2 + 1 



d , d ^ 1 + 5 (x 2 + 1) -1/2 (2x) 

— (sinh - 1 x) = — (In (x + ^x 2 + 1)) = 



This doesn't look too good but it is tidied up amazingly by multiplying 
the top and bottom by ^jx 2 + 1 . We then get 

\/x 2 + 1 + x 1 

. neat! 



Vx 2 + 1 (x + v* 2 + 1) V* 2 + 1 

example (2) This time, we differentiate an inverse trig function. 

We will find dyldx if y = tan -1 x (or arctan x as it is also known). 
Remember that y = tan -1 x means that y is the angle between -ti/2 
and jt/2 whose tan is x. I explained this in Section 5.A.(i). 
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We start by saying that 
dx 

x = tan y so — = sec 2 y. 
dy 

Then we use the identity tan 2 y + 1 = sec 2 y to get sec 2 y = x 2 + 1 , so 

dx dy 1 

= x 2 + 1 and 



dx x 2 + 1 
giving us the result 



1 

(tan" 1 x) 



dx x + 1 



example (3) This example shows how we can apply the above result. 
Suppose we need to find 

d 

— (tan- 1 (2x + 3)). 
dx 

We don't need to do all the previous working again because 2x + 3 is 
itself a function of x. Therefore we can just use the Chain Rule, putting 
X= 2x + 3, and remembering that dyldx = (dy/dX) (dX/dx). (See 
Section 8.C.(a) if necessary.) 
Here, we have 

y = tan" 1 (2x + 3) = tan" 1 X and X = 2x + 3 

so 

dy 1 dX 

— = and — = 2. 

dX X 2 + 1 

Therefore 

dy 2 2 2 



dx X 2 +l (2x + 3) 2 + 1 4x 2 + 12x + 10 

In general, we can say that if y = tan" 1 (lump), and the lump is a 
function of x, then 

d I 1 \ d 

— (tan- 1 (lump)) = — — X — (lump). 

dx \ (lump) + 1 / dx 



If you think of it this way, you will probably be able to write the answers down straight 
away. 

We get a particularly useful version of this if we put (lump) = xla where a is a constant. 
This gives us 

d I x\ 1 1 a 2 1 a 
tan" 1 - = X - = X 



dx \ a ) x 2 la 2 +1 a x 2 + a 2 a x 2 + a 2 
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This result is very useful for finding some particular kinds of integral, as we shall see in 
Section 9.B.(d). 

Exactly the same system can be used to differentiate inverse functions of other more 
complicated functions. 

So, for example, if we have sinlr 1 (lump), then 



d 1 d 

— (sinlr 1 (lump)) = - X — 

dx v (lump) 2 + 1 dx 



(lump). 



In particular, if (lump) = x/a, we have 

/ x\ 1 
— sinh — 

dx \ a 



1 



1 



1 



X 



\x 2 la 2 + 1 a ^x 2 + a 2 a ^x 2 + a 2 

I have tidied up the first fraction by multiplying it top and bottom by a, remembering that 
a put inside a square root must be written a 2 . 

(1) By choosing suitable values for a, and using the pair of results 



^- (tan" 1 (x/a)) = ° - and (sinh" 1 (x/a)) = , 
dx x 2 + a 2 dx y* + 0 



differentiate the following with respect to x. 

(a) tan- 1 ^-j (b) sinh" 1 ^-j (c) tan" 1 ^— 



(d) sinh" 1 



IX 



(2) Use the Chain Rule to differentiate the following with respect to x. 

(a) tan" 1 (sx) (b) sinh" 1 (3X) (c) tan" 1 (x + 3) (d) tan -1 (3X + 4) 
(e) tan" 1 (1 - x) (f) sinh" 1 (ix + 1) (g) sinh" 1 (3 - 2x) 



(h) tan" 1 



x + 3 



(i) tan" 1 



2X + 1 



0) tan" 1 



3* + 2 
1 + x 



d 1 

(3) Show that — (tanh -1 x) = using tanh -1 x = \ In 

dx 1-x 2 1 - x 

(4) Solve the equation 8 sinh x = 3 sech x. 

(5) Find all the possible solutions of the following equations. 

(a) 2 sinh 2 x - 5 cosh x - 1 = o (b) 3 sech 2 x + 8 tanh x - 7 = o 

8.E Some uses for differentiation 

8.E.(a) Finding the equations of tangents to particular curves 

In Section 8.C.(e), we found the gradient of two of the tangents to the curve 

y = (x + 3)/(x - 2) 

by using the Quotient Rule to find dyldx for this curve. 

Since dyldx tells us the steepness or gradient of a curve at any given point on it, it makes 
it possible for us to find the equation of the tangent to the curve at any point on it, provided 
that this is a point where the curve has a tangent, and none of the problems of Section 8.A.(f) 
exist. 
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Here are two examples of doing this. 



example (1) Find the equations of the tangents to the curve y = x 2 - Ax + 3 at the 
two points (a) (5,8) and (b) (2, -1). 

To find the gradients of the tangents we differentiate y = x 2 - Ax + 3 with respect to x 
giving dyldx = 2x - A. 



This gives us the rule to find the gradient of the tangent for any value of x. 
It is not the equation of the tangent. 



(a) When x = 5, dyldx = 10 - 4 = 6 so m = 6 for the tangent at (5,8). 

Using y -y\ = m(x -Xj) for the equation of the tangent, from Section 2.B.(f), 
we have y - 8 = 6(x - 5), so y = 6x - 22 is the equation of tangent (a). 

(b) When x = 2, dyldx = 0. 

What is happening here? Try drawing your own sketch to show how this makes 
sense. 



If dyldx = 0, the tangent is horizontal. This tangent is at the lowest point of the curve 
y = x 2 - Ax + 3, and its equation is y = -1. 

I show a sketch of the curve and these two tangents in Figure 8.E.I. 




Figure 8.E.1 



example (2) Find the equations of the tangents to the curve y = cos x when 
(a) x = nil, (b) x = tt/6 and (c) x = it. 

If y = cos x then dyldx = - sin x so the gradient of tangent (a) 
is - sin jtI2 = - 1 . It touches the curve y = cos x at the point 
(pt/2, 0) and its equation is y = -l(x - Jt/2) or y + x = ji/2. 

The gradient of tangent (b) is - sin jz/6 = -5. 

(We found the sin, cos and tan of jt/6, Jt/4, and jr/3 (that is, 30°, 45° 
and 60°), in Section 4.A.(g).) 

Tangent (b) touches the curve y = cos x at (jt/6, V 3/2) and its 
equation is y - yf3/2 = —j (x - nl6) or 2y + x = jt/6 + ^[3 
or y = -\x+{nl\2 + V3/2). 
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This looks a little unfriendly, but it is not surprising that the equation of 
a tangent to a cos curve should involve numbers like n and yj3. The value 
of jr/12 + yf3/2 is 1.13 to 2 d.p. and this agrees with the look of the y 
intercept of tangent (b) on the graph sketch which I have drawn below. 

The gradient of tangent (c) is - sin it = 0 so this tangent is 
horizontal. Its equation is y = -1. 

All three tangents are shown here in Figure 8.E.2. 




\ 



Figure 8.E.2 

exercise 8.E.1 Find the equations of the tangents to the curves 

(1) y = e x at (a) x = o (b) x = l and (c) x = 2. 

(2) y = tan x at (a) x = o and (b) x = jr/4. 

Draw sketches in each case to show these tangents. 
Use one of the results which you have found in (1) to decide how many 
solutions there are to the equations (i) x = e x and (ii) 3X = e x . 

(3) There is something special about one of the tangents in Example (2) above 
and one of the tangents to the curve y = tan x in question (2) in this exercise. 
Can you spot what this special property is? There were examples of tangents 
with this same property in the previous section, too. 

8.E.(b) Finding turning points and points of inflection 

A turning point on a curve with the equation y = f(x) is a point at which dyldx = 0, or 
f'(x) = 0, writing the same thing in function notation. Turning points are also sometimes called 
stationary points, and the values of f(x) where fix) = 0 are called stationary values. 

From the examples which we have just looked at in the previous section we can see that 
it will be useful when sketching curves if we can find where the horizontal tangents are, that 
is the points where dyldx = 0. Finding the answer to this will not only help us to draw graph 
sketches, but also to extract useful information about physical relationships. (For example, 
in Section 2.D.(g), the horizontal tangent is at the point of the curve corresponding to the 
highest point reached by the ball, so dsldt = 0 at this point.) 

Sometimes it is also helpful to know what the value of d 2 yldx 2 is for particular values of 
x. d 2 y/dx 2 means dldx (dyldx), so it tells us the rate of change of the rate of change with 
respect to x. We used it, but with a different letter, in Section 8A.(e) when we found d 2 xldt 2 
for x = cos t. 
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To help you to understand the different possibilities, I have drawn sketches showing 
interesting points on the curves of some simple functions in Figure 8.E.3. You should fill in 
your own answers to the questions I have asked you in the table below the drawings. 




In aJl tfcese sketches O masks both tF*. origin and zfcro. 

Figure 8.E.3 



Function 


dx 


dy 

Value of — 

dx 


dy 2 
d? 


dy 2 

Is — +, - or 0? 

dx 


(a) y = x 2 at 0 


2x 


0 


2 


+ 


(b) y = x 3 at 0 










(c) y = -x 2 at 0 










(d) y = tan x at 0 










(e) y = cos x 

(i) at ,4 

(ii) at B 

(iii) at C 

(iv) atD 










(f) 7 = x 4 at 0 











8.E Some uses for differentiation 



337 



Now check your answers to this table. These are given at the back of the book as Table 
8.E.2 after the answers to Exercise 8.E.I. 

Next, go back to the curves in Figure 8.E.3 and look at what is happening to the steepness 
of the curve either side of the marked point in each case, and try answering the following 
questions. 

(1) Is the slope positive or negative? 

(2) Does this sign change as you move through the marked point? 

(3) Is the steepness increasing or decreasing as you move through the marked 
point? 

(4) What happens to the sense of turn of the curve either side of the marked point? 
(I have shown this with curved arrows.) 

You may find that it helps you to think about what is happening here if you sketch in 
some of your own tangents to the curves in my diagrams. (I'd suggest using pencil for this, 
then you can do it more experimentally.) 

It's important for your understanding here that you do try to answer these questions 
yourself. Don't just skip to the next bit to get them answered for you. 



Now, we'll look together at what the answers to the four questions above tell us. 
We find that the points marked with letters (including the various points at the origin, 
marked O in each diagram) fit into three different categories. 
These are as follows: 

(1) At O in diagrams (a) and (f), and at C in (e), we have what is called a local 
minimum ('local' because sometimes curves may dip down below this value 
somewhere else). At these points, the value of dyldx is zero because the tangent to 
the curve is horizontal. As we pass through these points, the slope of the tangents 
changes from negative to positive as the value of x increases. The sense of turn 
remains anticlockwise through these points. 

(2) At O in diagram (c), and at A in (e) we have what is called a local maximum. 
Again, the value of dyldx is zero at these points. As we pass through these points, 
the slope of the tangents changes from positive to negative as the value of x 
increases. The sense of turn remains clockwise through these points. 

(1) and (2) give the result that 



dy 

— = 0 at any local maximum or minimum. 

dx 



(3) At O in diagrams (b) and (d), and at B and D in diagram (e), we have points where 
the curve flexes itself. These are called points of inflection. The tangent to each 
curve at these points crosses the curve there, and the sense of turn changes. At O 
in (b) and (d), and at B in (e), it changes from clockwise to anticlockwise, and at 
D in (e) it changes from anticlockwise to clockwise. 
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O in (b) is the only one of these points where we also have dyldx = 0. 

Either side of each of these points the slope of the tangents remains either 
positive or negative. In the first three cases, the slopes of the tangents first become 
flatter as we approach the point and then steeper again once we are through it. This 
means that the slope itself has a local minimum at the point concerned. In other 
words, dldx (dyldx) = d 2 y/dx 2 = 0 at each of these points. 

In the fourth case, of curve (e) at D, the slope becomes steeper as we approach 
D, and then less steep once we have passed D, so this slope has a local maximum 
at D. Again, d 2 y/dx 2 = 0. 

If you find d 2 yldx 2 for the other examples we have met of tangents crossing 
curves, you'll see that it is also zero at these points. (You could check for yourself 
with y = sin x, y = sinh x and y = tanh x, all at the origin.) 



d 2 y 

= 0 at any point of inflection 

dx 



We have seen that d 2 y/dx 2 = 0 at any point of inflection. 
What will happen to the value of d 2 yldx 2 at a local maximum or minimum? 
At each of the local maximum points from (c) and (e), the slope of the curve goes from 
positive to negative, so the change in the slope is negative. 
In both cases, d 2 yldx 2 is negative at the maximum point. 

At the two local minimum points of (a) and (e), the slope of the curve goes from negative 
to positive, so the change in the slope is positive. 

In both cases, d 2 yldx 2 is positive at the minimum point. 

The case of the local minimum in (f) works out a little differently. The slope of the 
curve goes from negative to positive, and its rate of change is positive except at the point 
O itself. 

We have d 2 yldx 2 = 12x 2 = 0 at the point O, although it is positive either side of O. 
At O itself, the curve is very blunt because it has its four roots of x = 0 all bunched 
together here. This has the effect of making the rate of change of dyldx at this point (that 
is, d 2 yldx 2 ) equal to zero. This effect, which will happen whenever a curve is blunt like 
this, makes the rules for testing for maximum and minimum points slightly more 
complicated, because it is only sometimes possible to use the sign of d 2 yldx 2 to test which 
we've got. 
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Here is a summary of the above results, so that we can use them to find out how particular 
curves will behave. 



Finding and classifying turning points and points of inflection 



For a point to be a local maximum, dyldx must be equal to zero. Then use either 

Test (1): the gradients of the tangents move through the point in the sequence 
+ 0 -, so test the value of dyldx either side of this point, 

or 

Test (2): if the value of d 2 y/dx 2 is negative at this point, then it is a local 
maximum, but if d 2 y/dx 2 = 0 then Test (1) must be used. 



For a point to be a local minimum, dyldx must be equal to zero. Then use either 

Test (1): the gradients of the tangents move through the point in the sequence 
- 0 +, so test the value of dyldx either side of this point, 

or 

Test (2): if the value of d 2 yldx 2 is positive at this point, then it is a local 
minimum, but if d 2 yldx 2 = 0 then Test (1) must be used. 



For a point of inflection, 

(1) the value of dyldx does not change sign as it moves through the point (it may 
or may not be equal to zero at the point itself), 

and 

(2) the value of d 2 yldx 2 at the point must be equal to zero. 



8.E.(c) General rules for sketching curves 

The tests outlined in this previous section give us useful extra information which we can use 
for sketching graphs. 

I have already listed informally the other questions which we need to answer in order to 
draw a graph sketch in Section 3.B.(i) where we sketched y = (x + 3)/(x - 2). You should look 
back at how we built up this sketch before going on. 

Now that we can include finding the turning points, I can give you a complete summary 
of the questions which you need to answer in order to sketch a curve. 

For convenience, I will call this curve y = f(x) but, of course, other letters can be 
used. 
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Questions to answer in order to draw a graph sketch 

(1) Does the curve cut the j-axis? If so, where? (Try putting x = 0.) 

(2) Does the curve cut the x-axis? If so where? 

(This is the same as asking if the equation f(x) = 0 has any roots on the 
x-axis.) 

(3) Are there any values of x which have to be excluded because they would mean 
trying to divide by zero? 

If so, what are they? (Such values of x will give you vertical asymptotes. 
An asymptote is a line which the curve of the graph of the function becomes 
closer and closer to.) 

What happens to the values of f(x) for values of x just either side of the 
forbidden values? 

(4) What happens to the values of f(x) when x takes very large positive or 
negative values? 

(If it gets closer and closer to some fixed limit then this will give you a 
horizontal asymptote.) 

(5) Are there any turning points? (That is, are there any values of x for which 
f'(x) or dyldx = 0?) If so, what are they? 

You will need to find the value of f(x) (the stationary value) for each of 
these values of x. 

Test each turning point to find whether it is a local maximum, local minimum 
or point of inflection. (The tests for this are at the end of the previous section.) 
You don't usually need to find points of inflection where dyldx * 0 unless you 
are specifically asked to do so. 



An example to show these tests in action 

We'll draw a sketch of 

x - 5 

y=f(x) = 



x 2 -9 

so we go through answering each of the questions in the list above in turn. 

(1) Puttingx = 0 gives f(0) = f so the curve y =f(x) cuts thej-axis at the point (0, §). 

(2) f{x) = 0 ifx - 5 = 0 so ifx = 5. 

The curve y =/(x) cuts the x-axis at (5,0). 

(3) Any value of x which makes x 2 - 9 = 0 must be excluded. 

x 2 - 9 = (x + 3) (x - 3) so we can't have x = -3 or x = 3. 
The lines x = -3 and x = 3 are vertical asympotes of y =/(x). 
Testing with nearby values of x, using a calculator, gives: 
The value of f(x) is large and negative if x is just less than -3. 
The value of f(x) is large and positive if x is just greater than -3. 
The value of f(x) is large and positive if x is just less than +3. 
The value of fix) is large and negative if x is just greater than +3. 
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(4) The easiest way to see what will happen to y =/(x) = (x - 5)/(x 2 - 9) if x takes very 
large positive values, is to divide the top and bottom of the fraction by x 2 . This 
gives us 

x - 5 1/x - 5/x 2 

y x 2 -9 1 - 9/x 2 ' 

Now, as x becomes very large, each of 1/x, -5/x 2 and -9/x 2 becomes very 
small. 

We can say that, as x — > °°, each of 1/x, -5/x 2 , and -9/x 2 — > 0. 
So we will have y — > \ or 0 as x — > °°. 

Exactly the same thing happens for large negative values of x, so the line y = 0 
(which is the x-axis - be careful here!) is also an asymptote. 

Check with some large values of x on your calculator that the value of y really 
is getting close to zero. 

You could also look at what is happening entirely experimentally by using your 
calculator, but you might then be left with a sneaky feeling that perhaps the curve 
does some strange unforeseen wiggle which your calculator hasn't revealed. 
Remember that you can't ever prove what a curve will do by testing with numerical 
values, but you can certainly prove that it won 't do something. It is always wise to 
check your ideas of what it does do. 

A mistake which students quite often make when graph-sketching is to work 
out exact values for some very boring bit of the curve which is almost a straight 
line. Then they think that the whole thing is probably a straight line, so getting a 
total disaster. The method I have given you here shows you how to find all the 
interesting bits. 

(5) Differentiating y =f(x), using the Quotient Rule, we get 

dy ( (x 2 -9)(l)-(x-5)(2x) 10x-x 2 -9 
dx (x 2 -9) 2 (x 2 -9) 2 

dy „ 
so — = 0 if lOx - x 2 - 9 = 0 or x 2 - lOx + 9 = 0. 

dx 

Factorising gives (x - 1) (x - 9) = 0 so x = 1 or x = 9 for the stationary values. 
(You could also find these by using the quadratic formula to solve the 

1 4 ' 

equation.) The two stationary values are /(l) = 5 and f(9) = 75 = jg, so the 
turning points are (1, 5) and (9, jg). 

^^^^ Remember that these turning points are points on the original curve, so that to 
^H-^V find them you must substitute the two values of x which give them into the 
equation of the original curve. 



Now we want to know whether there are local maximum or minimum points on the 
curve. 

Finding d 2 yldx 2 is not a pleasant prospect here, so we look at the values of dy/dx or 
f'(x) either side of x = 1 and x = 9. 
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Passing through x= 1, the sequence goes - 0 + giving a local minimum of/(l) = 5. You 
can show this here by choosing, say, x = 0 and x = 2 and substituting these values into the 
expression which we have found for dyldx. These particular values give -9 and 25, 
confirming the sequence of - 0 +. 

Similarly, passing through x = 9, the sequence goes + 0 - giving a local maximum 
of/(9) = i 

Notice that the value of the local minimum is actually greater than the value of the local 
maximum for this curve. 

We now have all the information we need to draw the graph sketch. I show this in Figure 
8.E.4. 




Figure 8.E.4 



exercise 8.E.2 Now try sketching the graphs of the following functions yourself. 

Students often find graph-sketching difficult, but if you answer all the 
questions in my list for each curve, you should find that you can draw the 
sketches successfully. You will also understand why the curve behaves as it does, 
which won't be the case if you just use a graph-sketching calculator. 

X — 1 X 

(a) y = f{x) = — (b) y = g(x) = (c) y = h(x) = x + 4/x 

x 2 - 4 1 + x 2 

9 

(d) y = p(x) = x - - (e) y = fix) = x 2 e" 
x 



8.E.(d) Some practical uses of turning points 

Being able to find the turning points of a function can have much wider implications than 
just making it easier to sketch its graph. In particular, it gives us a method of answering 
many practical questions. 

Since most of the examples we shall look at together in this section involve the 
volumes and surface areas of solid shapes, I am putting in a table here to give some of 
these. 
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A summary of volumes and surface areas of the commonest solids 

The four solids are shown in Figure 8.E.5. 




Figure 8.E.5 



In each formula, V stands for volume and A stands for surface area. 

(1) For a closed rectangular box, V = Ibh and A = lib + 2bh + 2lh. 

(2) For a closed cylinder, V = Jir z h and A = 2nr 2 + 2nrh. 

(3) For a cone, including its base, V = \jir 2 h and A = icrl + Jtr 2 . 

(4) For a sphere, V = jTtr and A = Anr 



#A volume must always involve three lengths multiplied together. A surface 
area must always involve two lengths multiplied together. If you find that 
you have an equation for which this isn't true, go back and rechecM 
Something has gone wrong somewhere. 



example (l) This is typical of the sort of problem which we can now solve. It comes 
in two parts. 

(a) A manufacturer wishes to construct a metal can to hold a given 
volume of liquid. If the can is made entirely of the same thickness 
of metal, what is the best ratio of the height of the can to its radius 
so that the least amount of metal is used? 

(b) To make the construction more rigid it is decided that it will be 
necessary to use a double thickness of metal for the top and bottom 
of the can. In order to keep the cost of production to a minimum, 
what dimensions should the can now have? Give the answer again in 
the form of the best ratio of its height to its radius. 

(a) We start by drawing a sketch of the can (which I have done in 
Figure 8.E.6, giving it a height of h and a radius of r). 

Next, we label the other quantities we shall need to deal with. 

Let the volume be V, the area be A, and the ratio of hlr be x. 
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The curved surface 
un rolled 

2irr 



Since the can is being made to hold a given quantity of liquid, we 
know that V is a fixed quantity. We have V = nr 2 h, and hlr = x, so 

V 

V '= icr 3 x and x = . 

Jtr 3 

The surface area, A, is made up of the two circular ends of the can 
and its curved surface, which would unroll to give a rectangle. 
This gives us A = 2nr 2 + 2nrh. 

At present, we only know how to differentiate functions with one 
variable, but the expression which we have for A involves the two 
variables, r and h. 

However, we know that the fixed quantity V = nr 2 h, so h = Vljir 2 . 

Substituting this for h gives us 

A = 2nr 2 + 2nr I — — I = 2nr 2 + —. 



nr 2 1 r 



We've now got A described entirely in terms of the one variable, r. 
Since we want a minimum value oi A, what should we do next? 



We should find dAldr, and look for values of r which make it equal 
to zero. We get: 

dA 2V V 

— = Anr - — = 0 if Ttr 3 = — 
dr r 2 2 

so the ratio 

h V 

- = x = = 2. 

r VI2 

(Remember when you differentiate that both ji and V are constants.) 
Now we check for certain that this gives a minimum value for .4. We 

get 

d 2 A 4V 
= 4ji 



dr 2 r 3 

which is positive since the value for r which we have found is positive. 
Therefore, we have found the ratio which gives a minimum value for A. 

We have found that the surface area is smallest when the radius of 
the cylinder is half its height. This means that the vertical cross-section 
through the central axis will be a square. 
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(b) Now that the two ends of the can are to be made from a double 
thickness of metal, it seems likely that we should make the can taller 
and thinner in order to minimise the amount of metal we use. We will 
assume that a double thickness costs twice as much, and take the cost 
per unit area of the curved sides of the can to be c. Then the metal in 
the two ends will cost 2c per unit area, and we will call the total cost of 
the can C. 

V and x will have the same equations as before but we will now have 

/ , 2V\ 
C = Anr 2 c + Inrhc = 4jrr 2 + — c 

so 

dC ( 2V\ V 
— = $7ir - — c = 0 if Jir i = — so x = 4. 
dr \ r 2 J 4 

In the same way as before, this gives a minimum for the cost, so we 
have found that the height should now be four times the radius. 

We can see how this pair of answers might work out numerically by 
taking the particular case of a half-litre can. This makes V = 500 cubic 
centimetres. 

Then, in case (a) where h = 2r we have 500 = 2jtr 3 so r = 4.30 cm 
to 2 d.p. and h = 8.60 cm to 2 d.p. 

In case (b) where h = 4r we have 500 = 4jrr 3 so r = 3.41 cm to 2 
d.p. and h = 13.66 cm to 2 d.p. 

example (2) What is the volume of the largest cylinder which can be placed inside a 
cone of fixed height H and radius R so that it just touches it inside, as I 
show in Figure 8.E.7(a)? Is it possible to fill in more than half the space 
inside this cone with such a cylinder? 

We can see that the possible shape of the cylinder can vary between 
a sort of thin pencil to a flat biscuit. The largest possible size will occur 
somewhere between these two extremes. 

I will call the height of the cylinder h and its radius r. 

Then its volume V is given by V = nr 2 h and we have to find the 
largest possible value of V (which will be in terms of R and H, the 
radius and height of the cone), as r and h vary. 
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Since, at present, we can only differentiate functions with one 
variable, we must somehow use the physical relationship of the cone to 
the cylinder to find h in terms of r. To see how we can do this, we take 
a vertical cross-section along the joint axis of the cone and cylinder 
which gives us Figure 8.E.7(b). 

We can now use the two similar triangles, ABC and ADE. These 
triangles nest into each other, so their sides are in the same proportion. 
Therefore 

BC AB r H-h 

= so — = 

DE AD R H 

so rH = RH - Rh and Rh = RH - rH = H(R - r). 
Therefore 

H(R - r) 

h = . 

R 

Substituting this for h in the equation V = Jir z h we get 

nr 2 H(R - r) JtHr 3 

V- = JtHr 2 . 

R R 

We can now find dVldr (remembering that ji, H and R are all 
constants). We get 

dV 3jiHr 2 I 3r 

— = 2nHr = JtHr 2 - — 

dr R \ R 

To find the maximum V, we put dVldr = 0. Now 

/ 3r\ 3r 2R 

JtHr 2 - — = 0 if r = 0 or — = 2 so r = — . 
\ R) R 3 

We can see physically that r = 0 gives us the minimum value of zero for V. 

Also, d 2 VI dr 2 is negative if r = 2RI2> . (Check this for yourself.) 
Therefore, this value of r gives us the maximum volume. 
How high will this cylinder be? We have 
H(R -r) H(R- 2R/3) H 
R R 3 

so it is one third of the height of the cone. 
The volume of this cylinder is 



71 



(2RV(H\ AtiR 2 H 



\ 3 / \ 3 / 27 
The volume of the cone is fiR H so the proportion of it which is filled by 

4 1 4 

this largest possible cylinder is (57) -h (5) = 5, that is, less than half of it. 



exercise 8.E.3 Try these for yourself. 



(1) What is the maximum volume of a square-based open box made by cutting 
squares from the corners of a square piece of cardboard with sides 10 cm long, 
and then bending up the sides. I'm assuming here that the sides will then be 
taped together - you don't have to make allowances for overlap. 
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(2) What are the dimensions of the largest cylinder which can be placed inside a 
sphere of fixed radius R so that its two ends just touch the sphere? Is it 
possible to fill more than half of the interior of the sphere this way? 

(3) What is the maximum distance from the origin of a particle moving on the x-axis 
so that its distance from 0 is given by the equation x = 3 cos t + 4 sin r? 

Before rushing into differentiating here, have a think about how else you 
could write 3 cos f + 4 sin f. (Look back at Section 5.D.(f) if necessary.) 

8.E.(e) A clever use for tangents - the Newton-Raphson Rule 

This is an ingenious application of the properties of tangents which makes it possible to find 
closer and closer approximations to the roots of equations which are too difficult to solve 
exactly. (In the United States, the credit for this method is usually given entirely to Isaac 
Newton and it is called Newton's method.) 
First, I'll explain graphically how it works. 

Suppose you have some equation f(x) = 0 which you want to solve, so you want to find 
as accurately as possible the point where the curve y =f(x) crosses the x-axis. It may, of 
course, do this more than once, but we will look at just one crossing point, where x = a, 
say. 

In order to start the Newton-Raphson process, we need to have some idea of where a is. 
Suppose that by some ingenious method we have been able to find that x = x 1 is a value close 
to a. Figure 8.E.8(a) shows the curve of f(x) near a and x x . 




Figure 8.E.8 



Then, if the curve really looks like my drawing, the tangent to the curve at x = Xj will cut 
the x-axis at a point x 2 which is closer to the true root a than x x was. 

How can we find out what x 2 is, from knowing what y =f(x) and x x are? 

The point P has coordinates (x x , y{), or (x l5 f{x{j), so we do know some 
measurements. 

From Figure 8. E. 8(b) we can say that the gradient of the tangent at P is f(x-i)/(x 1 - x 2 ). 
But the gradient of any tangent is also given by dyldx or f'(x) at the point concerned. This 
means that we know that the gradient of this particular tangent is f'(x x ). (Using/' here 
instead of dyldx is very handy as it makes it easier for us to talk about particular gradients.) 
We can now say that 

fix,) 

X\ — X2 
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Next, we need to rearrange this to give us a rule for finding x 2 . We get 

fa) 

(xj - x 2 )f(xi) =f(x 1 ) so xi-x 2 = ——■ 
This gives us 



The Newton-Raphson Rule 




(One of my students gave me a handy way of remembering which way round the last bit 
goes. She said 'dashed goes down'.) 

If x l is close to the root x = a, and if the curve is not too wiggly or behaving in other 
unexpected ways, then x 2 will be closer to a than x x was. (Sorting out these 'ifs' is what the 
subject of mathematical analysis does. It makes it possible to get results like this by 
analysing just what properties the curve must have near x = a for the method to work. For 
example, we certainly won't want any of the complications described in Section 8.A.(f).) 

Having found x 2 , we can then repeat the process to get an even better approximation of 
x 3 to a, and so on, until we have as many decimal places of accuracy as we require. 

Next, we will look at how this process works taking some particular examples. 

example (1) For my first example, I will take an equation that we can solve exactly, 
so that you will be able to see how this process actually gives the right 
answer. 



Suppose f{x) = x 3 + x 2 - 9x - 9 = (x - 3) (x + 1) (x + 3) so/'(x) = 
3x 2 + 2x - 9. 

I show a picture of f(x) in Figure 8.E.9. 

We can see from the factorisation that one of the roots of f(x) = 0 is 
x = 3. 

We'll take x = 4 as a starting value, and see if the Newton-Raphson 
process takes us towards the true root of x = 3. We have 



4 - 



7(4) 



= 4 = 3.26 to 2 d.p. 



%2 — -^i — 



/'(4) 
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>3C 



Figure 8.E.9 
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/(3.26) 

x 3 = 3.26 = 3.024 to 3 d.p. Check this for yourself. 

/'(3.26) 

/(3.024) 

x 4 = 3.024 = 3.000 to 3 d.p. Check this one, too. 

/'(3.024) 

In this particular example, the process is working beautifully, and you 
can see the successive answers homing in on x = 3. 

If we hadn't known where the roots were, we could have used the 
changes in sign of f(x) to show us where to look. 

Working out some values gives us /(4) = 35, f{2) = - 15, /(0) = - 9, 
/(-2) = 5and/(-4) = -21. 

Looking at the picture of Figure 8.E.9, you can see the sign changing 
either side of each root, as the curve crosses the x-axis. 

Have we made a brilliant discovery here? 

Will we always be able to use this system to find an interval in 
which a root must lie? Try deciding for yourself whether the following 
two statements are true. 

Statement (i) If fix) = 5x 3 + 6x 2 - 23x + 12 then f(0) = 12 and f(2) = 30. 
Therefore there is no root between x = 0 and x = 2. 
(x + 3) 

Statement (2) If f(x) = then f{\) = - 4 and/(3) = 6. 

(x-2) 

Therefore f(x) has a root between x = 1 and x = 3. 

If you don't agree with these statements, see if you can work out what is really 
happening. 

Everything you need to be able to do this has already come in this book. 



We can see what is really happening in the first case by using the methods of Section 
2.E.(a). 

We have f(x) = 5x 3 + 6x 2 - 23x + 12, and f{\) = 0, so immediately we know that 
statement (1) is false. 

What is actually going on? 

Since f{\) = 0, we know that (x - 1) is a factor of fix). 
Matching up the end terms gives us 

5x 3 + 6x 2 - 23x + 12 = (x - 1) (5x 2 + px - 12). 

Matching the terms in x 2 gives us 

6x 2 = -5x 2 + px 2 so p = 1 1. 

Now we have 

f(x) = (x - 1) (5x 2 + 1 lx - 12) = (x - 1) (5x - 4)(x + 3). 

4 

This means that the roots of fix) = 0 are x = 1 , x = 5 and x = -3. 
There are two roots in the interval from x = 0 to x = 2. I show a sketch of y = fix) in 
Figure 8.E.10. 
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Figure 8.E.10 



We would only see the sign change for the roots x = 5 and x = 1 by taking a value between 
them. Check for yourself that choosing such a value does make f(x) come out negative. 

Statement (2) is wrong for quite a different reason. We drew a picture of this function in 
Section 3.B.(i). The sign change here doesn't mean that /(x) has crossed the x-axis between 
x = 1 and x = 3. The curve has a jump or discontinuity when x = 2 and gets to the other side 
of the x-axis this way. 

The two examples above give us two useful rules to remember when looking for 
roots. 



Rules for using a sign change when looking for roots 

(1) If f(xi) and f(x 2 ) have different signs, there must be at least one root between 
x x and x 2 provided that fix) is continuous from x x to x 2 . 

(2) If f(x) is continuous, then a sign-change tells us that there is an odd number of 
roots in the interval. 



You can think of 'continuous' here as meaning that f(x) can be drawn with a continuous 
straight line. The subtle mathematical non-pictorial meaning of this word is described in 
courses on mathematical analysis. 

Obviously too, in order to be able to use the Newton-Raphson method we must be able 
to differentiate f(x). It mustn't have any of the problems which were described in Section 
8.A.(f), in the part where we are working. 

example (2) Next, we'll use the Newton-Raphson method to find all the roots of 
(a) tanh x = 2x and (b) tanh x = jx. 

How many will there be? Will it be the same number for both (a) 
and (b)? 

Try sketching what you think will happen, using Section 8.D.(g) if 
you need to. 
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We found in Section 8.D.(g) that y = x is the tangent to y = tanhx at the 
origin because dldx (tanh x) = sech 2 x and sech 2 (0) = 1 . 

We can see from this, and from the shape ofy = tanhx, that_y = 2x will 
cut j = tanh x only once, at the origin, so x = 0 is the only solution of (a). 

I show a picture of this in Figure 8.E. 1 1 . 




tanh at 



Figure 8.E.11 



y = jx will cut y = tanh x three times, once at the origin and also at 
two other points symmetrically placed either side of the origin because 
y = tanh x is odd. (Turn it upside down and it looks the same.) So we just 
have one solution to find. 

We want the value of x on the right-hand side of the graph for which 



tanh x = \ x 



so 



tanh x -\x = 0. 



We let fix) = tanh x -jx and look for the solution here of fix) = 0. 

From the sketch we can see that if x < a then tanh x > jx. It looks as 
though a may be quite close to 2. 

f(2) = - 0.036 to 3 d.p. so the root is to the left of this. 

/(1.9) = 0.006 to 3 d.p. The change in sign confirms that the root 
lies between 1.9 and 2, since fix) is continuous. 

Since /(1.9) is closer to zero, we'll start with x x = 1.9. 

We have f(x) = tanh x -\x and fix) = sech 2 x -\. This gives us 

0.006237 \ 
-0.414390/ 



x 2 = 1.9 



1.915 to 3 d.p. 



/ 3.354578 X 10~ 6 \ 

x 3 = 1.915 - = 1.915 to 3 d.p. 

\ -0.416813 / 

The three solutions of tanh x = \x are x = -1.915, x = 0 and x = 1.915, 
correct to 3 d.p. 



example (3) Show, by drawing a sketch, that sin x = 3 - 2x has just one solution. 
Find this solution correct to 3 d.p. 

See how far you can get with this one yourself before you look at 
my solution. 
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^^^^ You must work in radians here, so set your calculator in radian mode. 



We want to solve sin x = 3 - 2x which is the same as 
sin x - 3 + 2x = 0. 

We let f{x) = sin x - 3 + 2x so /' (x) = cos x + 2. 

We can see from the sketch of Figure 8.E.12 that the root is less than 

3 

2- It also looks as though it could be greater than 1 . 




f(l) = -0.159 and /(1.5) = 0.997 so there is a root between x = 1 and 
x = 1.5 since f(x) is continuous. 

Since /(l) is closer to zero, we'll start with x x = 1. This gives us 

/ -0.159 \ 

x 2 =*i - = 1-063. 

\ 2.540 / 

/ -1.81829 X 10~ 4 \ 

x 3 = 1.063 - = 1.063 to 3 d.p. 

\ 2.48625 / 

so the solution is x = 1.063 radians correct to 3 d.p. 

exercise 8.E.4 For each of the following, draw a sketch to help you decide where the roots of the 

following equations might lie, and then use the Newton-Raphson process to find 
these roots correct to 3 d.p. 

(1) 2x 3 - 3X 2 + 6x + 1 = 0 (2) e x = 3 - x, (3) (0) sinh x = \ x (b) sinh x = 2x 
8.F Implicit differentiation 

8.F.(a) How implicit differentiation works, using circles as examples 

How could we find the rate of change of y with respect to x if we have a relation between 
them which does not give y described in terms of x? Let's look at two examples. 

example (1) Suppose we are given the equation x 2 + y 2 = 25. 

This is the equation of the circle whose centre is at the origin and 
whose radius is five units. (See Section 4.C.(d) if necessary.) 
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The relationship here between x and_y is called implicit, because we 
don't have it in the form ofy given as some expression inx. 

We can easily draw a sketch of this circle, and we can see how steep the 
curve looks at any point on it by sketching the tangent at that point. 
(Indeed, we can actually find this slope, using the property of the tangent 
being perpendicular to the radius, as we did in Section 4.C.(f).) 

But how can we find dyldx for this circle? Developing a technique to do 
this will make it possible for us to find dyldx for other curves where we 
have no alternative method of finding the gradient. 

One possibility would be to start by rearranging its equation so that we 
have_y 2 = 25 -x 2 . 

What is yl Can you see a possible complication here? 



We have y = ± ^25 -x 2 . This is not a function because there are two 
possible values of y for each possible value of x. These possible values of x 
lie between -5 and +5 inclusive. 

We can see exactly what is happening in Figure 8.F. 1 . 




Figure 8.F.1 



The equation j = + ^25 -x 2 gives the top half of the circle. 

The equation j = - ^25-x 2 gives the bottom half of the circle, and each 
of these are functions. 

Differentiating these square roots would not be very pleasant. We 
therefore argue that it would seem reasonable to go through the equation 
x 2 +y 2 = 25 differentiating it term by term with respect to x in just the 
same way that we differentiated the equation y = x 2 - 4x + 3 term by term 
to give dyldx = 2x - 4 in Section 8.E.(a). 

(The equation j = x 2 - 4x + 3 gives y explicitly in terms of x.) 

The problem that we have with this new equation is that we shall need to 
differentiate y 2 with respect to x. 

We can do this by using the Chain Rule. We know thatj 2 differentiated 
with respect to y is 2y, and we then multiply this answer by dyldx. 

We are saying that 

dx \dy I \dx) dx 
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Now, differentiating x 2 +y 2 = 25 term by term with respect to x gives us 

dy dy dy x 

2x + 2y — = 0 so 2y — = -2x and — = - — . 
dx dx dx y 

How does this result fit in with the particular examples shown on Figure 
8.F.1? 

4 

At the point (4,3), dyldx = -3. 

This agrees with what we know the gradient of the tangent here must be, 
because the gradient of the radius to this point is 4. The tangent is 

4 

perpendicular to the radius so its gradient is -3. (This uses m { m 2 = - 1 from 
Section 2.B.(h).) 

In fact, at any point on this circle with coordinates (x,y), the gradient of 
the radius isy/x and the gradient of the tangent is-x/y. We can see here that 
geometry and calculus both give us the same result. 

This extends to special cases like the gradient at the point (0, -5) which is 
zero, and the gradient at the point (-5, 0) where dyldx is undefined. Clearly 
from the diagram this must be so, because the tangent here is vertical. 

example (2) Suppose this time that we want to find dyldx for the implicit equation 

x 2 -6x+y 2 -4y=12. 

What kind of curve does this give? 



This equation can be written in the form (x - 3) 2 + (y - 2) 2 = 25 . 

It describes the circle whose centre is at (3,2) and whose radius is 5 units. 

We drew this particular circle in Section 4.C.(f) and found the gradients 
and equations of some of its tangents. This will help us now to see what is 
happening geometrically, and so to be able to make sense of some of the 
answers which we get by calculus. 

To find dyldx for this circle we again differentiate its equation term by 
term, remembering that y differentiated with repect to x is simply dyldx. We 
get 

dy dy 
2x - 6 + 2y — - 4 — = 0. 
dx dx 

Remember that differentiating a number gives zero, so dldx ( 1 2) = 0. It doesn 't 
change so its rate of change is zero. 

Tidying up gives 

dy dy 6 - 2x 3 - x 

— (2y - 4) = 6 - 2x so — = = 

dx dx 2 y - 4 y - 2 

dividing top and bottom of this fraction by 2. Always simplify when you can. 

We can now see how this ties in with the gradients of the four tangents 
which we already found for this particular circle in Section 4. C.(f). 

The points of contact of these tangents are (7,5), (-1 , -1), (3,7) and (8,2). 

Try using dyldx yourself here to find the gradients of these four tangents. 
Use Figure 4. C. 1 1 in Section 4.C.(f) to sort out what is happening if some of 
your results seem rather curious. 



8.F Implicit differentiation 



355 



Substituting in these pairs of values forx and y in turn, we get 

dy 4 dy 4 

— at (7,5) is -3 and — at (-1, -1) is also -3. 

dx dx 

(We can see that this is right on Figure 4.C. 1 1 . The gradients of the two 
radii to the points of contact are both 4.) 

dyldx at (3,7) is zero and the tangent there is horizontal. 

^^^^ When you find that the gradient of the tangent at the point (8,2 ) is dyldx = -5/0, 
^B-^V don't be tempted to cross your fingers and say that this is zero as many students 
do! dyldx at (8,2) is undefined because the tangent is vertical. 



Using Figure 4. C. 1 1 we have seen geometrically that the answers which 
we have found by differentiating do make sense. 

Also, from this same diagram, we can see that the gradient of the radius to 
the point (x, y) on this circle is (y- 2)/(x- 3). 

Therefore, using m l m 2 = -1 , the gradient of the tangent at this point is 

x - 3 3 - x 
y - 2 y-2 
which is exactly what we get by differentiating. 

exercise 8.F.1 Find dyldx for the circle whose equation is 

x 2 + i6x + y 2 - 4/ - 101 = 0. 

Use this result to find the gradient of the tangents to this circle at the four 
points with coordinates (4, -3), (-3, 14), (-8, -11) and (-21, 2). 

Draw a sketch of this circle showing these four tangents. 

Check your results with the answers to Exercise 4.C.3 which find these same 
gradients without differentiating. 



8.F.(b) Using implicit differentiation with more complicated relationships 

What will we do if we have a curve whose equation has a term with x and y multiplied 
together? Let's look at an example. 

example (1) Suppose we have the equation 2x 2 + xy - y 2 = 5. 

In order to differentiate xy with respect to x we use the Product Rule 
because we have the two variables x and y multiplied together. (See 
Section 8.C.(d) if necessary.) 

This gives us 

d dy dy 

— foO = XI) +x — =y + x—. 
dx dx dx 

So differentiating 2x 2 + xy - y 2 = 5 with respect to x gives 

/ dy \ dy 
Ax + [y + x — - 2y — = 0. 
\ dx dx 
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So 

dy dy —Ax — y 4x + y 

— (x - 2y) = -Ax - y or — = = , 

dx dx x - 2y 2y - x 

multiplying top and bottom of the fraction by -1 to make it easier to handle. 

At the point (2,3) on this curve, (check that it is!), we have dy/dx = ^ 

giving the slope of the tangent here, and therefore the gradient of the 

curve at this point. 

We can find the equation of this tangent using y - y x = m(x - x x ) 

from Section 2.B.(f). It is y - 3 = ^ (x - 2) or Ay = llx - 10. 

As students sometimes find this process slightly tricky, I shall give you another example 
which needs the use of the Product Rule. 

example (2) Find dy/dx for the equation 2x 3 - 3x 2 y + 5xy 2 + 2y 3 = 6 and hence find 
the gradient of the tangent at the point (1,1). 

I think it is easier if you take the numbers outside as factors for the 
terms involving the Product Rule, particularly if they are negative, as it 
is easy to lose one of the minus signs otherwise. 

I shall start by showing the equation split up in a working sort of 
way as follows: 

2x 3 - 3([x 2 ] [y]) + 5 ([x] [y 2 ]) + 2y 3 = 6. 

Now, differentiating all through with respect to x we get 

6x 2 - 3([y] [2x] + [x 2 ] [dy/dx]) + 5(\y 2 ] [1] + [x] [2y dy/dx]) + 6y 2 — = 0. 

dx 

Tidying up, we get 

^ dy „ dy „ dy 

6x 2 - 6xy - 3x 2 — + 5y 2 + lOxy — + 6y 2 — = 0 
dx dx dx 



so 



dy 

— (6y 2 + \0xy - 3x 2 ) = 6xy - 5y 2 - 6x 2 
dx 

dy 6xy - 5y 2 - 6x 2 



dx 6y 2 + lOxj - 3x 2 

The slope of the tangent at the point (1,1), i.e. the gradient of the curve 
at that point, is given by 

dy 6-5-6 5 

dx 6+10-3 13" 

Try this similar example for yourself now. 

example (3) Find dy/dx for the curve given by the equation 3x 3 + 7x 2 y - 3xy 2 + 2y 3 = 2 1 . 

Hence find the gradient of the tangent at the point (1, 2) on this curve. 



^^^^ Make sure that you haven't started off your answer with 'dy/dx =' because this 
^H-^V is not at all what you mean. 
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What you are doing here is differentiating the whole expression with 
respect to x, so your answer should only start with 'dyldx =' if the 
original equation starts in the form 'y ='. 

Setting out the given equation in a working sort of way as I did in the 
last example gives 

3x 3 + 7 ([x 2 ]\y]) - 3 ([x][y 2 ]) + 2y 3 = 21. 

(You don't have to do this step, but if you are at all unsure about 
keeping track of where you are then I think it will help you.) 

Differentiating this expression term by term with respect to x gives 



9x 2 + 7 [y][2x] + [x 2 ] 



dy 
dx 



3 [y z ][l] + [x] 



dy 
dx 



, dy 
6y 2 — = 0. 
dx 



Again I have used square brackets so I can show you how the Product 
Rule is working on each bit. Tidying this up gives 



9x 2 + I4xy 
dy 



lx' 



dy 



3y 2 - 6xy 



dy 



so 



dx 
therefore 



(7x 2 



dy 
dx 



dx dx 
6y 2 - 6xy) = 3y 2 - 9x 2 - Uxy 

3y 2 - 9x 2 - I4xy 
6y 2 + lx 2 - 6xy 



dy 

6y 2 — = 0 
dx 



At the point (1,2) on the curve, the slope of the tangent is dyldx = 
-25/19. 



8.F.(c) Differentiating inverse functions implicitly 

In Section 8.D.(i), we differentiated inverse trig and hyperbolic functions using dyldx = 
\l{dxldy). This seems reasonable if we take these as the limiting cases of dyldx and ll(dxldy) 
respectively. For this rule to work, we have to be sure that dyldx is not equal to zero for any 
particular value of x which we might want to consider. 

It is also possible to differentiate these inverse functions implicitly. 

I'll show you how you can do this by taking the example of y =f(x) = skr 1 (x/a). 

We know from Section 5.A.(h) that y must lie between - tt/2 and + nil inclusive. The 
function itself looks like the sketch in Figure 8.F.2. 




Figure 8.F.2 
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Notice that for this function we must have -a < x < a. 

We have j =/(x) = sin -1 xla, so therefore xla = sin y because this is an inverse function, 
and there is only the one possible value for y from a given value of xla. 
Differentiating xla = sin y implicitly with respect to x gives 

dy dy 1 

Xla = cos y — so 



But 



dx dx a cos y 

sin 2 j + cos 2 y = 1 so cos 2 y = I - sin 2 y = 1 



2 2 2 

x a -x 



2 2 

a a 



We can see from Figure 8.F.2 that the gradient of y = sin 1 (xla) is positive, and therefore 
we can say 

dy 1 1 \ a 2 ^ 1 



dx a cos y a \ a 2 - x 2 ^a 2 - x 2 

(The means that we are taking the positive square root here.) 
We know that for this function to work we must have -a < x < a. 
Now, we must also exclude x = a and x = -a. Why do we have to do this? 



The answer is: because when x = a or x = -a the tangent toy = sin (xla) is vertical. The 



fraction 1/y a - x is undefined. We've now seen what happens when dyldx = 1/0, and why 
we must exclude this possibility. 
This gives us the following result. 




In Section 8.D.(f) we showed that cosh -1 x = In (x + -jx 2 - 1), and that, to have this inverse 
function for y = cosh x, we needed to restrict the possible values of x by saying that x > 0, 
so that we were taking the cosh of a positive quantity. 

We'll now differentiate f(x) = cosh -1 (xla) implicitly. In Figure 8.F.3 I show a sketch of 
y = a cosh x and_y = cosh -1 (x/a). It is very similar to Figure 8.D.6 except that 1 is replaced 
by a. 




Figure 8.F.3 
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We can see from this sketch that, ify = cosh 1 (x/a), then we must have x > a. 
Try finding dldx (cosh -1 (x/a)) for yourself by using implicit differentiation. 



The method is very similar to what we just did for y = sin -1 (x/a). 
We have y =f(x) = cosh -1 (x/a) so x/a = cosh y. 
Differentiating implicitly with respect to x gives 

1 dy dy 1 

— = sinh y — so — = ; . 

a dx dx a sinh y 

Using cosh 2 y - sinh 2 y = 1 gives us 



We can also see that the gradient of y = cosh 1 (x/a) is positive, so we can say that 



Also, like last time, we can't have x = a because this is where the tangent to y = cosh -1 (x/a) 
is vertical, and we would have the undefined fraction of q. 
We now have the result that 



We can use the Chain Rule with these results to differentiate more fancy functions in 
exactly the same way that we used it in Section 8.D.(i) for fancy inverse functions of sinh 
and tanh. 

I'll show you how to find dldx (sin -1 (x 2 - 1)) as an example. 
We have just shown that if y = sin -1 X then 






for x > a. 



dy 1 



dx V 1 -x 2 ' 

In this particular example, X = x 2 - 1 so dX/dx = 2x. 



Using the Chain Rule of dy/dx = (dy/dX) (dX/dx) we have 



For this to work, we would need to have 0<x 2 <2so-v2<x< yfl. 




p-x 2 ' 



2 



In general, we can say that 



d Id 
— (sin - (lump)) = - - X — (lump) 

dx v 1 - (lump) dx 



with the requirement that -1 < (lump) < 1. 
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Similarly, we have 



d Id 
— (cosh- 1 (lump)) = = X — (lump) 

dx v (lump) - 1 dx 

with the requirement that (lump) > 1 . 



You may be given the formula below as a rule for differentiating inverse functions. 



dy 1 
If y = f- x (x) and f~ x (x) = F{x) then — = F' (x) = 



dx f (F(x)) 



This has exactly the same meaning as what we have been doing above. The only difference 
is that it is written in function notation. 

I think that you might need reminding just how this works. 

• /' (x) means f(x) differentiated with respect to x. 

• /' (X) means /' (X) differentiated with respect to X. 

• /' (f{x)) means f(f(x)) differentiated with respect to f(x). 

In general, we can say that 



/' (lump) means /(lump) differentiated with respect to (lump). 



I can now show you where this formula comes from. 

Suppose we have an inverse function y = f~ l (x). 

Then x =f(y) because this is what an inverse function means. 

Differentiating implicitly with respect to x, using the Chain Rule, gives 

d dy dy dy 1 

i = — (fty)) — =/00— so 



dy dx dx dx f (y) 

But y =f~ l (x) and f~ l (x) is also a function of x. 
Suppose we call it F(x). Then we can say 

dy d 1 
(F(x)) = F'(x) 



dx dx f (F(x)) 



8.F.(d) Differentiating exponential functions like x = 2 f 

This particular function is the one which we used in Section 3.C.(a) to describe an example 
of cell growth. We said in Section 3.C.(e) that the rate of increase at any time t is equal to 
some constant, k, multiplied by the number of cells present at that time, but we couldn't then 
find the value of k. 

It's now easy for us to do this. We have x = 2' so, taking natural logs both sides of this 
equation, In x = In (2 r ) = t In 2 using the third rule of logs of Section 3.C.(d). 
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Differentiating In x = t In 2 implicitly with respect to t, we get 

1 dx dx 

= In 2 so — = x In 2. 

x dt dt 

We now know that the value of k is In 2. (I said it would be this in Section 3.C.(e).) 
We can also write this answer in terms of t if we want to. We have 

dx 

— = x In 2 = (In 2) 2' since x = 2'. 

The In 2 is the scaling factor which gives us the difference between the rate of increase 
of x = 2* and x = e*. If x = e', the scaling factor, k, is equal to 1. 

The rate of increase at any given time is the same as the quantity of the substance present 
at that time. 

Here is a second rather nastier-looking example. (Although it looks nasty, it is actually 
quite simple to do.) 

If y = x x +x , what is dyldxl 

Again we take natural logs both sides of the equation. This gives us 

In y = In (x* 2 + [ ) = (x 2 + 1) In x. 
Differentiating implicitly with respect to x, using the Product Rule, gives us 

I dy /1\ 1 
= 2x In x + (x 2 + 1) - = 2x In x + x + -. 

y dx \ x I x 



Therefore 



dy I 1 \ / 1 . . , 

= y 2x In x + x + — J = 2x In x + x + — (x x ). 



dx \ x I \ x 



8.F.(e) A practical application of implicit differentiation 

I shall finish this section on implicit differentiation by giving you an example of a practical 
use for it. 

The volume of metal in a hollow sphere remains constant. If the inner radius is increasing 
at the rate of 3 cm s _1 find the rate of increase of the outer radius when the two radii are 2 cm 
and 4 cm respectively. 

4 -t 

The volume of a sphere of radius r is j nr . I show a drawing of a cross-section through 
the centre of the hollow sphere in Figure 8.F.4. 




Figure 8.F.4 
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The volume of metal, V, is given by 
V=\ JiR 3 -\ Jtr 3 . 

V, R and r are functions of time, t, so differentiating implicitly with respect to t gives 

dV dR dr 

— = AjiR 2 — - 4jtr 2 — . 
dt dt dt 

The volume V remains constant so dVldt = 0. 
Therefore we have 

dR dr 
Ajx R 2 — = Anr 2 — . 
dt dt 

3 

At the instant we are interested in, R = 4, r = 2 and drldt = 3. Therefore, dRIdt = 5. 
At this instant, the outer radius is increasing at a rate of 5 cm s _1 . 



exercise 8.F.2 In the first three questions, differentiate the given equation implicitly, rearranging 

in each case to find an expression for dy/dx. Then use this answer to find 

(a) the gradient of each curve at the point whose coordinates are given in the 
question, and 

(b) the equation of the tangent to the curve at this point. 

(1) x 2 y 2 = 25. The point is (1,5). 

(2) x 2 + 3xy - y 2 = 2. The point is (1,2). 

(3) l/y + 2/x = 1 

Do this one by differentiating the equation given above. Then do it a second 
time by differentiating the equation you get if you multiply all through by xy 
to get rid of the fractions. 

In each case, use your expression for dy/dx to find the gradient of the curve 
at the point (4,2), and the equation of the tangent there. 

(4) Use the Chain Rule to differentiate the following two functions with respect tox 
(a) sin -1 (2x - 5) (b) cosh -1 (3* + 1) 

In both cases, say what restrictions you must put on x for the answer to work. 

d 1 

(5) Show that — (tanlr 1 x) = . 

dx 1- x 2 

(6) Differentiate the following three functions with respect to x 

(a) y = fix) = x* (b) y = fix) =3 x^* (c) y = f(x) = fex)* + * 

8.G Writing functions in an alternative form using series 

We know already that we can write 

1 + x + x 2 + x 3 + x 4 + . . . as provided that |x| < 1, 

1 - x 

using the rule for the sum to infinity of a GP, from Section 6.C.(c). 

We also found in Section 7.C.(a) that if we did a binomial expansion on (1 -xy 1 , keeping 
our fingers crossed that it would work with n = -1, we did in fact get the same series. 
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Might it be possible to find a way of writing other functions in the alternative form of 
series? 

The answer to this question is a qualified 'yes'. When calculus is approached entirely 
through proofs and arguments based on taking mathematical limits, one of the most 
powerful results to come out of this is Taylor's Theorem. This gives a proof that such 
series expansions do indeed exist if certain conditions are met. (We saw some reasons 
why such a formal approach is necessary to lay mathematically firm foundations for 
calculus in Section 8.A.(f).) This rigorous approach is beyond my scope here because to 
do it properly takes a great deal of space and a dedicated book. My purpose is to give 
you a working knowledge that you can use in other areas together with an intuitive feel 
and understanding for what is happening, which will then make a pathway to lead you 
into whatever depth you later need to go. It is, however, possible for me to show you what 
some of the results of this approach will be, so that you can see why they are 
important. 

Taylor's Theorem gives a way of approximating to functions by considering their rates 
of change, and the rates of their rates of change, and so on. In other words, if we have a 
function x = f(t), then the theorem makes it possible (with certain qualifications 
concerning how this function behaves) to write the function in terms of dxldt, d 2 x/dt 2 and 
so on, or, in function notation, in terms of f'(f), f'(t) and so on. There is a very good 
description of how this works in Louis Lyons' book All you wanted to know about 
mathematics but were afraid to ask (Cambridge University Press 1995). 

Because these series are so important we will look at some straightforward cases together 
now. We will only consider functions like e* or sin t or cos / where we know that we can 
continue differentiating for ever if we want to. Also, we will only look at functions for which 
we know that there is no problem differentiating when t = 0. 

Let us suppose that it is possible to write such a function as a series expansion so that we 
have 

f(t) = a 0 + a x t + a 2 t 2 + a 3 t 3 + a 4 t 4 + . . . which I will call (1) 

a 0 ,ai, etc. are coefficients which will depend upon the particular function f(t) which we are 
considering. 

If we put t = 0 in (1) above, we get /(0) = a 0 because everything else disappears. 

Also, if the series of (1) is truly representing the function f(t), we would expect that 
differentiating it term by term should give the same result as differentiating the function 
itself. (This isn't obvious - we have seen in Section 6.F that infinite series can behave in very 
odd ways indeed. It can in fact be shown that it does work for all the examples which we 
shall look at here.) 

If we differentiate (1) with respect to t, we get 



Putting t = 0 now gives us f'(0) = a x because all the further terms disappear. This system is 
beginning to look promising. Similarly 



f'(t) = a x + 2a 2 t + 3a 3 t 2 + 4a 4 t 3 + 5a s t 4 + . . . 



(2) 



f"(t) = 2(1)0 2 + 3(2)a 3 t + 4(3)a 4 t 2 + 5(4)a 5 t 3 . . . 



(3) 



so f"(0) = 2(\)a 2 . And 



f" {t) = 3(2)(l)a 3 + 4(3)(2)fl 4 f + 5(4)(3)a 5 f 2 + . . . 



(4) 



so/"(0) = 3(2)(1)0 3 - 
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Writing all these dashes for the successive differentiations is beginning to be a little 
clumsy, so I will replace them from now on with a number which stands for the number of 
times that f(t) has been differentiated. 

Do the next differentiation yourself, so finding f 4 (0). 



You should have/ 4 (0) = 4(3)(2)(l)a 4 . 

We can see that these are building up using factorials and we can write the results so 
far as 

f(0) f(0) f 3 (0) f 4 (0) 

a 0 =f(0) a x =— — «2 = ^p % = ^T ° 4 = ^T 

Since we are only considering functions here which can be differentiated as often as we like, 
we could say that a r =f (0)/r! where a r is the rth coefficient. (We started the count with 
r = 0). 

This gives us the following result. 



Provided that certain conditions are met, it is possible to say 

fit) =/(0) + 1 + t 2 + t 3 + 1 4 + ... 

1! 2! 3! 4! 



The little superscript numbers in the above expression refer to how many 
times f(t) has been differentiated; they are not powers. 



Series like this, which are a special case of Taylor series, are called Maclaurin series after 
the Scots mathematician Colin Maclaurin. 

We can now use these results to write down some particular examples. 

EXAMPLE (l) f(t) = e*. 

We have already found in Section 8.B.(a) that we can write e in the 
form of the infinite series 

1 1 1 

1+ — + — + — + ... 
1! 2! 3! 

Will we get a similar series for f(t) = e' if we use the above process on 
it? 

We know that f(t) = e' remains unchanged when it is differentiated 
with respect to t, so we have fit) = fit) = fit) =f (t) = e' for all 
values of r. Also e° = 1, so we get 

t t 2 t 3 f 
e' = 1 + — + — + — + ... + — + ... 
1! 2! 3! r! 

which agrees with our previous series if we put t = 1 so that we start 
with e. 
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This series can also be written in the 21 form which we described in 
Section 6.D. 
We would have 

oo .r 

e'=l-. 
r=0 r\ 

Notice that we are starting the count with r = 0. 

EXAMPLE (2) f(t) = Sill t. 

If/IO = sin t then/'(0 = cos t,f(t) = - sin t,f" (?) = - cos t and f"" (t) = 
sin t. 

At this point, we have come back to the beginning of the cycle again. 
Now, sin(O) = 0 and cos (0) = 1 so we have 

1 1 

ao = 0 ai = — a 2 = 0 a^=- — a 4 = 0 and so on. 
1! 3! 



So 



t t 3 t 5 f t 2r+1 
sin? = — - — + — - — + .. . + (-l) r 



1! 3! 5! 7! (2r + 1)! 

starting the count with r = 0. 

Notice the flip of the signs given by the (-1)'". If r is even, the term is 

positive, but if r is odd then the term is negative. 

This series can be written in the 2 form as 

0= t 2r+l 

sint=2 (-If . 

r=0 (2r + 1)! 

We have already found experimentally and geometrically in Section 
4.D.(e) that the first term of this series gives us a very good approximation 
for sin tif t is small. 



For the same reason as then, it is essential that t is measured in radians for 
any trig series. 



How far do you need to go with summing this series for sin t in the case of 
t = jt/6 to get a good approximation to the exact answer of 5? (Let's say 
'good' means to 4 d.p.) It works amazingly quickly. Try t = Till too. 

For this process to work at all, we are assuming that adding the terms of 
the various series that we get will bring us closer and closer to some definite 
sum the further we go, that is, we are assuming that these series are 
convergent. (I describe the meaning of 'convergent' in Sections 6.C.(c) and 
(d), through the various possibilities of what can happen when we sum GPs.) 
If any of these series isn't convergent, we would find as we did there with 
the geometric series 2, 6, 18, . . . and its non-existent sum to infinity, that we 
were writing nonsense. 
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EXAMPLE (3) ft) = COSh t. 

This time, the answers repeat in pairs when we differentiate. 
We have fit) = sinh t and/'(0 = cos h t and so on. Also, 

e° + e~° 2 e° - e~° 1 - 1 

cosh (0) = = - = 1 and sinh (0) = = = 0. 

2 2 2 2 

This gives us 

t 2 t 4 t 2r 

cosh t = I + — + — + ... + + . . . 

2! 4! (2r)! 



Writing this in 2 form gives us 

00 

cosh t = X 



00 t 2r 



0 (2r)! 

again starting the sum with r = 0. 

Now, suppose instead that we had wanted the series for ft) = 
cosh (2t). 

We could do this in two ways, of which the easiest is simply to 
replace the '?' in the series above by 2t. This then gives us 

At 2 I6t 4 (2t) 2r 

cosh 2t = 1 + + + . . . + + . . . 

2! 4! (2r)\ 

Alternatively, we could have successively differentiated f{t) = cosh(20 
with respect to t to find the coefficients. 

Each time that f(t) = cosh (2?) is differentiated with respect to t, it 
gets multiplied by 2 because of the Chain Rule, so the answer comes 
out exactly the same. 

EXAMPLE (4) f(t) = ln(l + t) 

(Why couldn't we find a series for In r?) 

(Because In 0 is undefined and so we immediately run into an 
impossible situation.) 

1 

If f{t) = ln(l + 0 then /(*) = . 

1 + t 

To find f'(t), it is easiest to write this as ( 1 + t)~ l . Then differentiating 
again, we get 

1 

fit) = -(1 + ty 2 = -. 

(1 + t) 2 

Find f(t), f\t) mdfit) for yourself. 



You should have the following answers 
fit) = ( 1)< 2)(1 + ,) : — 



(1 + o 3 

3! 

/ 4 (f) = (2)(-3)(l +t)- 4 = --^-^ 
fit) = (2)(-3)(-4)(l + 0" 5 = ^^j. 
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This gives us/(0) = In 1 = O,/ 1 (0) = l,f 2 (0) = - l,/ 3 (0) = 21, f 4 (0) = -3!, 
and/ 5 (0) = 4!. 

If this pattern continues, what will we have for / r (0)? 



We would get (-iy _1 (r - 1)!. 

Putting all this information together gives us the series 

t t 2 2' 3' 4' 
ln(l +0 = 0 + -- — + — t 3 - — t 4 + — t 5 - ... 

1 2! 3! 4! 5! 

(r- 1)! 

+ (-ly -1 f + ... 

r\ 

Cancelling as much as possible, we get the series 

t 2 t 3 t 4 t 5 c-iy -1 

ln(l +t) = t- — + — - — + — - ... + f + ... 

2 3 4 5 r 

We can write this in the 2 form as 

CO /_■! \r — 1 

ln(l + o = I r. 

r = 1 r 

Notice that this time we are starting the count with r = 1 . 

Try putting f = 1 , and see if you can find a good approximation to 
In 2 from summing the first few terms of the series above. 

You have In 2 = 1 -\ + \ -\ + \ -\ + . . . 

This is the series which we met briefly at the end of Section 6.F when we 
said that it is convergent, unlike the frog down the well, or harmonic series 
of 1+5 + 1 + 1+ .. • which isn't. 

As you feed in each successive term, you will see the sums flipping 
from one side to the other of the actual value of In 2. Unfortunately, 
although they are getting closer to In 2, this is happening extremely 
slowly. 

A much faster way of finding In 2 by means of a series comes from 
using 

At) = In ( = ln(l + 0 - ln(l - t). 

We know that ln(l + t) = t - \t 2 + \t 3 - \ t 4 + \t 5 - . . . 

Putting Y = -t gives ln(l - t) = -t - \t 2 - \t 3 - \t 4 - \t 5 - . . . 

So ln(l + t) - ln(l - t) = 2(t + \t 3 + \t 5 + . . . ) 

assuming that we can do this tricky move. 

What value of t would you have to use to find In 2 from 

1+0' 

In I I? 

1 -t 
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1 + ? ! 

Putting = 2 gives 1 + t = 2 - It so 3? = 1 and t = ?. 

1 -? 

Try substituting ? = \ in the series above and see how you now get 
on with finding a value for In 2. You will see that this series converges 
much more rapidly. 

There is an important question that we need to ask here. 
Do all of these series work for any value of ?? 

For example, we already know from Section 6.C.(c) that the GP of 1 + x + x 2 + x 3 + . . . 
is only convergent if |x| <1. 

It seems quite possible that the series which we have been looking at here only converge 
to their functions for a restricted set of values for ?. 

In fact, it can be shown that the series for e\ sin ?, cos ?, sinh ?, and cosh ? are convergent 
for all values of ?. However, the series for ln(l + ?) is only convergent if -1 < t < 1. 

exercise 8.G.1 Now try the following questions yourself. 

(1) Use the series expansion for e l which we found in the first example to write 
down the series expansions of (a) e 2f and (b) e _t . 

(2) Use the series expansion which we found for sin f in the second example to 
write down the series expansions for (a) sin 2f and (b) sin(-f). 

(3) Find the series expansion for f{f) = cos f and then use this to write down 
expansions for (a) cos 2f (b) cos(- f) and (c) cos (nf) where n is standing for a 
positive whole number. 

(4) Find the series expansion for f(t) = sinh f. 

Now check that you've got the right answer by adding your series together 
and using cosh r + sinh t = \{e t + e _f ) + ^(e f - e"0 = e f from Section 8.D.(a). 

Now that we have a way of writing e\ cosh t, sinh t, cos t and sin t in the form of series, 
the curious similarities which we have been finding in the behaviour of the trig functions 
cos? and sin? and the hyperbolic functions cosh? and sinh? no longer seem quite so 
surprising. 

We know that e* = cosh ? + sinh ?. 

It looks as though, with a bit of cunning juggling, we ought to be able to link e\ cos ? 
and sin ?. Can you see any way of doing this? Maybe if we put '?'=-?? Or if we subtracted 
the series? 



Whatever we do, we cannot quite make it work - yet. We shall find out in Section 
lO.C.(b) that these series do all slot together most beautifully. 
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In this chapter, we discover what kinds of information we can find out if we do the 
reverse process to differentiating. This includes finding areas and also finding the 
equations of functions if we know their rates of change. 
The chapter is split up into the following sections. 

9.A Doing the opposite of differentiating 

(a) What could this tell us? (b) A physical interpretation of this process, 

(c) Finding the area under a curve, 

(d) What happens if the area we are finding is below the horizontal axis? 

(e) What happens if we change the order of the limits? (f) What is J(i/x)dx? 

9.B Techniques of integration 

(a) Making use of what we already know, (b) Integration by substitution, 

(c) A selection of trig integrals with some hyperbolic cousins, 

(d) Integrals which use inverse trig and hyperbolic functions, 

(e) Using partial fractions in integration, (f) Integration by parts, 

(g) Finding rules for doing integrals like /„ = /sin" x dx, 

(h) Using the f = tan (x/2) substitution 

9.C Solving some more differential equations 

(a) Solving equations where we can split up the variables, 

(b) Putting flesh on the bones - some practical uses for differential equations, 

(c) A forwards look at some other kinds of differential equation, including ones 
which describe SHM, 



9.A Doing the opposite of differentiating 

9.A.(a) What could this tell us? 

We know from the last chapter that, when we differentiate a function y =f(x), we find a rule 
which gives us the slope or gradient of f{x) at any point on its curve. So, for example, if we 
have y =f(x) = x 2 then dyldx =f'(x) = 2x, and the gradients of this curve when x = 0, 1 and 
3 are given by dyldx = 0, 2 and 6 respectively. We can use this rule of dyldx = 2x to find its 
gradient for any given value of x. 

Now suppose that we start from the other end, and we are told that we have a function 
for which dyldx =f(x) = 2x. Since we know the slope of f(x) for any given value of x, we 
must know what this curve looks like, so we must surely be able to sketch it. Try doing this 
for yourself. 



It is true that we know what this curve looks like but we can't sketch it because we don't 
know where it is. Any curve shaped in the same way as y = x 2 , but shifted up or down, will 
have a slope described by dyldx = 2x so there are infinitely many possibilities. 

Can you think of an extra piece of information which we could be given, so that we would 
know which particular curve we were talking about? 
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If we are told a particular point which lies on the curve then there is only one possible 
answer. As the curve ofy = x 2 slides up and down, it only goes through this point in one 
possible position. For example, if we know that this particular f(x) passes through the point 
(1,3), then its equation must be y = x 2 + 2. 

Figure 9.A.1 below shows four possible curves for which dyldx = 2x, including the curve 
y = x 2 + 2. The other three are y = x 2 + 5, y = x 2 and y = x 2 - 3. 




-3 



Figure 9.A.1 

All the curves for which dyldx =f'(x) = 2x can be described by the equation f(x) = x 2 + C 
where C is the number giving the shift up from the x-axis of that particular curve. When you 
differentiate f(x), the C disappears because dCldx = 0. 

The results of these forwards and backwards processes have special names. 



fix) is called the derivative of f(x). 

Any f(x) that we get by working backwards from f'(x) is called an antiderivative 

of Ax). 

The difference is that there is only one derivative but infinitely many 
antiderivatives. 

The process of finding the antiderivative is called antidifferentiation. 



exercise 9.A.1 If I tell you what f'{x) is for three functions, and also give you one point which each 

curve goes through, see if you can find the equation of the curve and sketch it. 

(1) f'(x) = e x and y = f(x) passes through the point (0,2). 

(2) g'(x) = sinh x and y = g(x) passes through the point (0,0). 

(3) h'(x) = 3X 2 and y = h(x) passes through the point (1,2). 

9.A.(b) A physical interpretation of this process 

Suppose that there is a race which takes place on a straight track. (The track is straight so 
that I can make everything happen in the same direction. This will mean that velocity will 
be the same as speed, since velocity is speed with its direction given also.) 
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At the beginning of this particular race, all the competitors start running with the same 
constant acceleration of 2 m s~ 2 (that is, 2 metres per second per second). 

Velocity is the rate of change of distance with respect to time so, for travel in a straight 
line, we can say that v = dxldt. (v stands for the velocity after a time t, and x is the distance 
travelled from the starting line.) 

Since acceleration is the rate of change of velocity with respect to time, we can also say 
that dvldt = 2. (We also talked about acceleration in Section 8.A.(e).) 

We take t = 0 when the starting pistol fires. All of the competitors then start running, so 
after a time t we have v = 2t since dvldt = 2 and v = 0 when t = 0. (This physical knowledge 
is the equivalent of our known point on the curve in the last section.) 

Now, what will be the distance travelled from the starting line after t seconds? 



Again we do the reverse process to differentiation, this time on the equation v = dxldt = It. 

This gives us x = t 2 (since t 2 differentiated with respect to t is 2t), so the distance that 
these runners have travelled from the starting line after t seconds is t 2 metres. 

Or is it? Knowing that dxldt = It, can you be absolutely sure that x = t 2 l 

Suppose that, although all these runners started off with identical accelerations, after 
some further amount of time some get tired and slow down, perhaps because they are very 
young. If this is known in advance, the runners might be given staggered starts. If a runner 
begins at the starting line, then indeed after t seconds he will have run t 2 metres. But if a 
runner was given a start of 3 metres, say, then after t seconds he will actually be t 2 + 3 metres 
from the starting line. 

If another runner is given a start of C metres, she will be t 2 + C metres from the starting line 
after a time t. The physical meaning of the C here is the start which the runner has been given. 
A runner who is so good that he is placed behind the starting line will have a negative C. 

Similar processes will be involved in any physical situation involving distance, velocity 
and time. 

Here is a second example. 

If a particle is moving in a straight line so that v = It + e 2t find (a) the acceleration, a, 
and (b) the distance x travelled after a time t, given that x = \ when t = 0. 
What are the particular values of a and x when t = 21 

(a) We have a = dvldt = 2 + 2e 2t . 

(b) Since v = dxldt = 2t + e 2> , we have x = t 2 + \ e 2t + C. (Check this by differentiating 
back!) 

We know that x = \ when t = 0 so we know \ = 0 + \ + C, giving C = 0. 
When t = 2, a = 2 + 2e 4 = 111.2 to 1 d.p. and x = 4 + \ e 4 = 31.3 to 1 d.p. 

exercise 9-A.2 Try the following questions yourself. Questions (i)-(a) have the same meanings 

forx, v and a as in the example above. 

For questions (5) and (6) it is helpful to know that, since a = dv/dt and 
v = dx/dt, we can also write a as d/dt {dx/dt) = d 2 x/dt 2 . 

(1) If x = st 3 + 3t 2 find (a) v = dx/dt and (b) a = dv/dt. 

(2) If v = 3t 2 + it and x = 5 when t = 0, find (a) x and (b) a = dv/dt. 

(3) If a = dv/dt = 2f + 1 and v = 5 when t = 0, and x = 0 when f = o, find (a)v and (b)x. 

(4) If v = 4e 2t and x = 1 when t = o, find (a) x and (b) a. 

(5) If v = - sin f and x = 1 when t = 0, find (a) x and (b) a. 

(6) If v = cos(r + jt/6) and x = \ when t = o, find (a) x and (b) a. 
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9.A.(c) Finding the area under a curve 

We know that differentiation has the graphical meaning of telling us the steepness or 
gradient of a curve at any given point on it, provided none of the problems described in 
Section 8.A.(f) is present. 

This means that we can draw a second graph using the gradients of the first graph. Here 
is a physical example of this. 

Suppose we have a particle moving outwards from the origin along the positive x-axis so 
that its distance x from O is given by x = t 3 - t 2 + 2t. From this equation, we can see that 
x = 0 when t = 0. This particle is at the origin when we start measuring the time. 

The velocity v of the particle is given by v = dxldt, so we have v = 3t 2 - 2t + 2. From 
this equation we see that v = 2 when t = 0. 

We can show both the change in distance over time and the change in velocity over time 
graphically, and I have done this in Figure 9.A.2(a) and (b). 




Figure 9.A.2 



These two graphs include the pairs of values 

on (a) 
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on (b) 



The actual path of the particle looks like Figure 9.A.3. 



Here, whtn H«.rc wh*.n 

t-o t=i 



Figure 9.A.3 
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Here whtn 



The gradients of the tangents to graph (a) in Figure 9.A.2 for each particular instant in 
time give us the height of the curve of graph (b) at that same instant in time. This is because 
these gradients tell us the rate of change of x with respect to t at these points, so they tell 
us the velocity of the particle at those times. We have got a direct geometrical link going 
from graph (a) to graph (b). 
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Now comes what is probably the most important question in this chapter. Is there a 
geometrical way of looking at graph (b) which relates it back to graph (a)? 

Is there a way of showing the distance travelled by using the graph of velocity against 
time? 

If the velocity was constant, say 3 ms 4 , then the distance travelled in 2 seconds would 
be 3 X 2 = 6 metres. But the curve of graph (b) shows a velocity which is continually 
changing, so we know that the distance travelled will depend upon the particular time 
interval that we choose. Suppose we consider the time interval from t = 1 to t = 2. How could 
we find the distance travelled in this time interval using this continually changing velocity? 
(We know what it should be from graph (a), but how could we find it using only graph 
(b)?) 

We could say that if we took a very tiny time interval indeed then the velocity would 
hardly have changed during that time interval. Therefore, multiplying the velocity at the start 
of that tiny time interval by the length of the interval would give us a very good estimate of 
the distance travelled during that tiny time interval. 

We will call the tiny time interval dt, just like we did in Section 8.A.(a) in the last chapter. 
Then if we have a velocity of v at time t, the distance travelled in the time interval from t 
\ot + dt is given fairly accurately by the shaded strip of graph (a) shown in Figure 9.A.4(a). 
This strip has an area of vdt. (You need to imagine that dt is much smaller than I have drawn 
it. I have only drawn it this wide so that I can make my diagram clearer.) 




Figure 9.A.4 



This strip is a rectangle so it does not follow the shape of the curve at its top but, if we 
make the tiny interval small enough, we shall be very close to the area under the curve for 
this small time interval. 

I'll take A to be the total area under the graph over the time interval from t = 1 to t = 2, 
and I'll let 5A be the exact area under the curve over the time interval dt. I show A and 5A in 
Figure 9.A.4(b). 

A is made up of the sum of all the little strips like the one I've shown, each of them over 
a time interval dt. Therefore, if we use the sign '-' which means 'is approximately equal to', 
we can say 

t=2 t=2 

A= 1 6A~ 1 vbt. 

t=\ t=\ 

(The particular v depends on the particular strip you are looking at.) 
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Now, the smaller we make the time interval dt, the closer this sum gets to the exact area 
of A. We call this sum the integral of v from t = 1 to t = 2 and write it as 



The sign / comes from an elongated S for 'sum'. 

The two values of t, which here show the time interval we are considering, are called the 
limits of the integral. 

This integral is now giving us the distance travelled from t = 1 to t = 2, but what does 
it actually mean? 

From Figure 9.A.4(a) and (b), we can say that 6A ~ vdt so dAldt ~ v. 
But, from the last chapter, if we let (5? — > 0, we will have dAldt = v. 
Therefore we have the amazing result that A, the area under the curve, must be what we 
get if we do the reverse process to differentiation on v. 



Integration and differentiation are the reverse processes of each other. 

This is called the Fundamental Theorem of the Calculus and is crucially 
important to all mathematicians, scientists and engineers. 



Surely now we have the link which takes us back from graph (b) to graph (a) in Figure 
9.A.2, since we know that doing the reverse process to differentiation on v gives us x. We 
know that antidifferentiation gives us lots of possible answers, so how do we use the values 
of t = 1 and t = 2 at the ends of our particular time interval to get both the particular A of 
graph (b) and the particular distance travelled from f=ltof = 2of graph (a)? 

Doing the reverse process to differentiation gives us 



We know that our particular x is actually t 3 - t 2 + 2t, but the process of antidifferentiation 
won't tell us this, so I have included a constant, C, here. 

If we put t = 2, we get 8 + C. 

If we put t = 1 , we get 2 + C. 

The distance travelled from t = 1 to t = 2 is given by the difference between these two 
answers, which is 8 - 2 = 6. This agrees with what we get by feeding in t = 1 and t = 2 
directly to x, and then subtracting to find the distance travelled in this time interval. 

Notice that the C has cancelled out in the working we did from the integral. We can see 
why this happens by thinking about why the C is there in the first place. Its physical meaning 
is that graph (b) in Figure 9.A.2 could equally fit a second particle which starts C units from 
the origin when t = 0, but which moves with exactly the same velocity as the first particle. 
(This particle is doing exactly the same thing as a runner given a lead of C from the starting 
line in Section 9.A.(a).) The velocity/time graphs for both particles are identical, but the 
distance/time graph for the second particle would be pushed up C units from the t-axis. Both 
particles travel identical distances in any given time interval. In particular, they would each 
travel 6 units in the interval from t = 1 to t = 2. 

We have now found the geometrical link which takes us back from graph (b) to graph (a). 
I show this happening in Figure 9.A.5. 





from t = 1 to t = 2. 
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Figure 9.A.5 



We have the extraordinary result that the area under a curve made up of the sum of tiny 
strips, each depending on the equation of the curve, can be found from a calculation just 
involving the values at the two end points. 

As you go further with learning about integration in later parts of your course, you will 
see this same kind of simplification happening again, so that an area surrounded by a closed 
curve can be given in terms of its boundary curve, and a volume can be given in terms of 
the surface enclosing it. This is your first step along this powerful path of simplification. 

Any other distance travelled by the particle of our example can be found just by changing 
the limits on the integral. 

For example, the distance travelled from t = 0 to t = 2 is given by 



Notice that I have also written in the limits on the square brackets above. Sometimes these 
are written either end of a single vertical line. The working would then look like this: 



The opposite process to differentiation which we have done here is called integrating. 



An integral which has given limits is called a definite integral. 



To find what this area is, first integrate f(t) to give F(t), say. Then use the limits 
to find F(a) and F{b). The area is given by F(b) - F{a). 

An integral with no limits is called an indefinite integral. Integrating this gives us 
/ f(t) dt = F(t) + C where F'(t) =f(t). Integrals can also be written using other 
letters, most usually x. 




(3t 2 - 2t + 2)dt = t 3 - t 2 + 2t\ =8-0 = 8. 



o 



f(t)dt tells us the area under the curve x = fit) between the two limits 
t = a and t = b. 
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You can think of F{t) as either giving you a rule to find any particular area if you put in 
given limits, or as describing the curve whose rate of change with respect to t is fit), so 
F'{t) = fit). This description needs to include the constant C because we don't know the 
amount of vertical shift of F(t) away from the horizontal axis. Just as with the curves in 
Section 9.A.(a), we can only find the value of C if we are given more information. C is called 
the constant of integration. It doesn't affect any area which we find because it cancels out 
when the limits are put in. 




If you are finding a solution to a physical problem by using indefinite 
integrals, it is extremely important to include all the constants of integration, 
so that you have allowed for all the possible solutions. Often, you will have 
extra information which will make it possible to find the values of these, but 
leaving them out will make all your subsequent working wrong. 



We have just looked at one physical example here, so that we can actually see what is 
happening, but it can be shown by the methods of mathematical analysis that it is always 
possible to find areas under curves in this way provided none of the complications of Section 
8.A.(f) is present, and that we do indeed get closer and closer to a particular^ as we make 
the strips narrower and narrower and then add them all together. 

exercise 9.A.3 Try finding the following integrals yourself. Because integration is the reverse 

process to differentiation, you can check any answers you are doubtful about by 
differentiating them back before putting in your limits. 

Draw sketches to show the areas you are finding for questions (3), (4) (5) and (6). 

(1) f 3 (2f 2 + 3 f + i)dt (2) f 4 (2t 3 -t)dt (3) {** smxdx 

Ji Jo 'o 

(4) f" /6 2 sin x dx (5) f* /6 sin 2x dx (6) P cosh x dx 

Jo '0 '0 

(7) T M 3 cos 21/ du (8) P e 2 * dx (9) P e 3X + 1 dx 

Jo J\ Jo 

Now try finding the following indefinite integrals. Remember to include a 
constant of integration in your answers. 

(10) J (f 2 + 2f) dt (11) j sin 3X dx (12) j cos (2* + ir/2) dx 

(13) j sinh 2u du (14) j e 5 * dx (15) j e 2x + 3 dx 

The integrals in the next three questions each represent the distance x 
travelled by a particle moving in a straight line. Find x in each case, using the 
given information to find the value of each of the constants of integration. 

(16) J (8t 3 + t)dt given that x = 2 when t = o 

(17) j sin 2t dt given that x = 1 when t = 0 

(18) j {t + e 2t )dt given that x = 0 when t = o 
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We shall look at what happens in the special case when the acceleration is 
constant in question (2) of Exercise 9.C.2 at the end of Section 9.C.(b). 



To get the maximum worth from the distance/time and velocity/time graphs 
of Figure 9.A.2(a) and (b), we can link them back to Section 8.E.(b) on 
turning points and points of inflection. I show these two graphs again in 
Figure 9.A.6. I have also added the acceleration/time graph and the sketch of 
the actual path of the particle. 

See if you can describe physically what is happening to this particle 
around the instant in time when t = \. Think about what happens to its 
acceleration, its velocity and how the distance it has travelled changes. See 
how this fits in with the point of inflection and the local minimum you can 
see on the two graphs. The answer to this thinking point is given at the back 
of the book after the answers to Exercise 9.A.3. 




o y 3 




o >% 




Here. wVitn t=0 Here when t = l 



© V= 3f-- 2t+X ©0.= = = 6t-: 
^ 



9 



Here, when t- 2 



(£) Tint actual p*CH of ffcc porKcle 



Figure 9.A.6 



9.A.(d) What happens if the area we are finding is below the horizontal axis? 

As an example of this, suppose we have v =f(t) = St 3 - 24t 2 + I6t. This gives a graph like 
the one I show in Figure 9.A.7. 




Figure 9.A.7 
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Integrating, we have 



(8f 3 - 24t 2 + \6t)dt 




o 



2 



and 



(8f 3 - 24t 2 + 16?)* 




St 3 + st 2 } 2 



1 1 



-2. 



(Check this!) 



The second area, which is below the horizontal axis, has worked out as negative. This also 
means that 



What kind of physical meaning could we give to this? 

The graph could represent the velocity/time graph of a particle which moves 2 units away 
from the origin along the positive x-axis in the time interval from t = 0 to t = 1 and then 2 
units back again in the time interval from t = 1 to t = 2. This means that effectively it hasn't 
moved from its starting point over the interval from t = 0 to t = 2, and the minus sign of the 
second integral shows that this part of its journey was in the backwards direction. If it is 
claiming travel expenses, it will have to work out the distance travelled in two separate bits, 
taking the absolute value of each. Doing this gives a total of 2 + | - 2 1 = 4 units of distance 
travelled. 

Any area below the horizontal axis will integrate as negative. 



Sometimes it is possible to get an integral in which the lower limit is a larger number than 
the upper limit, so that in a sense you are looking at the area backwards. You can probably 
see what will happen; the integral will change its sign. 
Using Figure 9.A.6 above as an example, we would have 



You could give a physical meaning to what is happening here by saying that, looking 
backwards over the first time interval from t = 0 to t = 1 , the particle was 2 units nearer to 
the origin at the beginning of this time interval. 

Having the limits swapped in this way is something which actually happens. You will find 
when you do integration by substitution that sometimes the altered limits do have the larger 
number as the lower limit. You can see from the above example that it would then be possible 
to swap them round to the usual position, provided that you also change the sign of the 
integral. 

In general, we can say 




9.A.(e) 



What happens if we change the order of the limits? 




\ b f(t)dt = -\ a f(t)dt. 

J a J b 



As in all these rules, you can replace t by x, or any other letter of your choice. 
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9.A.(f) 



What is / (l/x) dx? 

Since 



— (x") = nx"-\ 
dx 

we know that 

x" + 1 

f x" dx = + C. 

J n+l 

But this rule will not work if n = -1 because we will get a zero underneath the fraction and 
therefore it will be undefined. 

So what is / x -1 dx or / (l/x) dx? 

From Section 8.B.(e), we know that dldx (In x) = l/x. Therefore, reversing the process, 
we have / (l/x) dx = In x + C. 

We can see what is happening graphically by looking at the particular example shown in 
the two drawings of Figure 9. A. 8(a) and (b). Figure 9. A. 8(a) shows In 3 on the graph of 
y = In x and Figure 9 A.8(b) shows In 3 as an area on the graph of y = l/x. 




Figure 9.A.8 



We have 



f 3 1 




— dx = 


In x 


Ji x 





= In 3 - In 1 = In 3 since In 1 = 0. 



i 

In a similar way, we could say 

x 

= lnX- 



[ x 1 




— dx = 


In x 


J 1 x 





In 1 = In X 



for any value of X greater than 1 . It is possible to define the log function from the function 
f(x) = l/x in this way. If this is done, e then comes from the inverse function e x of this log 
function, rather than through considering rates of growth as we did in Section 8.B.(a). We 
extend this rule to cover values of X between one and zero by reversing the limits as we did 
in the previous section. For example, 



In j = 



1/2 



1 



dx = 



1 



dx. 



1/2 
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From Figure 9. A. 8(b), you will see that we can't find any area under the curve of 
f(x) = l/x which involves crossing through the value x = 0 because the graph does an infinite 
jump there. (It has what is called a discontinuity.) 

However, we can find an area which is completely on the left of the origin, like the one 
shown as area B in Figure 9.A.9. Can we then give this area a meaning in terms of logs? 




Figure 9.A.9 



Area B is given by 



r 2 1 




— dx = 


In x 


J_ 3 X 





-2 



We feel a bit tempted to say that area B = ln(-2) - In (-3) except that we know that we can't 
because these logs don't exist. (You can see in Figure 9.A.8(a) that they just aren't there.) 
But, from the symmetry of the graph, we know that 



r 2 1 


f 3 1 




— dx = - 


- — dx = - 


In x 


J_ 3 X 


h x 





(In 3 - In 2). 



This means that we can sidestep the problem of trying to find the undefined ln(-3) and 
ln(-2) by saying 

area B = In |- 2| - In |- 3| = In 2 - In 3 = - area A. 



(Remember that \x\ means 'take the positive value of the x inside the | |'.) 
Notice that this result also fits in with the second rule of logs. 

3 2 

area A = In 3 - In 2 = In (5) and area B = In 2 - In 3 = In (5). 

3 2 

Checking on your calculator will show you that In (j) = -In (3). 
We know from Chapter 8 that 

d a 

— (In (ax + b)) = . 

dx ax + b 
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From this, and the working above, we get the reverse result that 



dx 1 
= - In \ax + b\ + C. 

ax + b a 



It is worth surrounding this box in bright colour since integrals of fractions like this are 
very common. It is safer always to use absolute value signs with log integrals. This prevents 
that horrid nail-biting situation of your calculator telling you ERROR when you try to get 
it to work out a negative log for you. 

exercise 9.A.4 Use the result above to write down the following integrals. (Check by 

differentiating your answer back if you are unsure about any of your numbers.) 

(1) ( — dx (2) \ — dt (3) \^—dx 

(4) l^—dx (5) [ — dt (6) \—^dt 

J 5-2X J 3-f j4 + 5f 

Find these three integrals if possible, giving your answers correct to 3 d.p. 

(7) f — ^— dx (8) f 3 ^— dx (9) [ — ^— dx 
J_ a 5 - 2x } 1 3X - 4 J_ 4 3 + ix 



You might be interested to notice here that the right-hand side of f(x) = l/x is the 
smoothed-out continuous version of the frog-down-the-well or harmonic series of 1 + 5 + 3 
+ 4 + ... of Section 6.F. You can see that the curve of y = l/x could be drawn through the 
tops of the separate jumps which I showed laid out side by side there in Figure 6.F.I. 

We showed in that section that, although the individual terms of this series become 
smaller and smaller the further we go, the sum of the series is infinitely large. If we stack 
the terms one on top of the other, we can reach any height we want to. The frog always 
escapes from the well eventually, however deep down he is to start with. 

In a similar way, it can be shown that the area under the graph from 1 to t becomes slowly 
but inexorably larger and larger, beyond any value we care to think of, as t increases. 

This might seem surprising since the strip giving the increase in area is becoming so thin, 
but the log function of In X = ff (l/x) dt can be made as large as we please by choosing a 
sufficiently large value of X, in just the same way that the sum of the harmonic series can 
be made as large as we please if we take a sufficiently large number of terms. 

You can also see that the log function will continue to increase if you look at Figure 
9.A.8(a). 

Its gradient, given by dldx (In x) = l/x, continues to be positive, although it becomes very 
small indeed as the value of x increases. 



9.B Techniques of integration 

In this section we look at various techniques which will help us to find more complicated 
integrals. When we have done this, we shall be able to apply this knowledge to solve some 
more physical problems. This will then form part of the last section of this chapter. 
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9.B.(a) Making use of what we already know 

When finding integrals the most important rule to remember is that integration and 
differentiation are reverse processes of each other. (In the last section, we looked at examples 
which showed this fundamental rule of calculus happening.) To check if you have integrated 
correctly, you can always differentiate your answer to see if you get what was inside the 
integral sign. (You would need to do this check before you put in any limits.) See how you 
get on doing the following exercise. 



exercise 9-B.i This exercise is in three parts. Part (A) consists of quickie questions on 

differentiation similar to the ones in Section 8.C.(a). 

(A) Differentiate each of the following with respect to the letter used to describe 
the function. 

(i) fit) = 4 r 2 (2) fix) = 3* 3/2 (3) fit) = e 2t (4) fix) = e^ +2 

(5) fit) = (6) fit) = In 3 t ( 7 ) fix) = In ( 3 + x) (8) fix) = In (2 - x) 

(9) fit) = In (3 - 50 (10) fit) = sin 3 r (11) fix) = cos (x + jt/3) 

(12) f(x) = sin iix + jt/6) (13) fit) = cosh (2f + 5) (14) fix) = sinh (8 - sx) 

(15) fix) = sin 3 x (16) fix) = tan x 

Check your answers to these before going on to (B). I think you will probably 
have found them very easy. (If you did have any problems here, it is important to 
go back to Section 8.C before you go any further with integration.) 

Part (B) has quickie questions for you to take back the other way by 
antidifferentiating or integrating. They are close enough to the questions in (A) 
that you should have a good idea of what to do with them. Remember that you 
need to add in a constant to these answers to show that you don't know the shift 
of your curve from the horizontal axis. The need for this is explained in Section 
9 A(a). 

(B) (l) fix) = 2X3 ( 2 ) r(f) = f i/2 (3) fl(f) = e 3t (4) r(x) = e *x+3 

(5) fit) = e" 2f (6) fit) = e 3 " 4t (7) fix) = — (8) fix) = 

x + 2 3x + 2 

(9) fix) = — do) fit) = sin S t (11) fix) = cos (x + jt/6) 
1 -x 

(12) fit) = sin (2f + tt/4) (13) fix) = cosh 3X (14) fix) = sinh (2x + 3) 

(15) fix) = 4 sin 3 x cos x (16) fix) = sec 2 2x 

Check your answers to (B) before going on to (C). 

(C) This has some straightforward integrals with limits (that is, definite integrals), 
for you to work out. If you are not sure what to do here, you need to go back 
to the first section of this chapter. 



(1) f 2 iu 2 + 3u)du (2) [ 3 ( 5 r 2 + i/f)rff (3) f cos4fdf 

Jo '1 'o 

(4) f" /6 sin 2x dx (5) [" /2 sin (2x + jr/3) dx (6) P e 5t dt 

Jo 'o J -1 

(7) f° e 2t+3 dt (8) P sinh2xc/x (9) T M sec 2 x dx 

J-l J-i Jo 

r3 du fl 1 4 rl 1 1 

(10) (11) + ^—dx (12) 

J ° 21/ + 3 J ° X + 2 2X + 5 J "i 2-X 2X + 



9.B Techniques of integration 



383 



Before going on to the next part of this section, we'll collect together the general rules 
we have been using above. 



Some rules for integration 

(1) lff'(x) = x" then f(x) = 



n + 1 



Cif n*-\. 



1 

(2) (a) lff'{x) = - then f(x) = ln|x| + C. 

x 

(b) If/'(x) = — 1 — then f(x) = - In \ax + b\ + C. 

ax + b a 

1 

(3) If f'(x) = e ax then f(x) = - e ax + C. 

a 

-1 

(4) If/'OO = sin {ax + b) then f(x) = — cos (ax + b) + C. 

a 

1 

(5) If f (x) = cos (ax + b) then f(x) = - sin (ax + b) + C. 

a 

1 

(6) If /'(x) = sinh (ax + b) then /(x) = - cosh (ax + b) + C. 

a 

1 

(7) If/'OO = cosh (ax + b) then /(x) = - sinh (ax + b) + C. 

a 

(8) If/'OO = sec 2 x then /(x) = tan x + C. 



See if you can work out what the answers to these three similar looking 
integrals are. Everything you need to be able to do them has come in 
previous chapters. 



(1) 



1 + x : 



dx (2) 



1 +x 



1 1 

dx (3) r dx 



1 -x 2 



9.B.(b) Integration by substitution 

This is just the Chain Rule in reverse. 

In fact, you have already been using it when you wrote down answers like 

J cos 4f rff = J sin 4f + C and j e 2t+3 dt = \e 2t+3 + C. 

Putting in the \ and the \ in these two answers means that you are allowing for the fact that 
At and 2t + 3 are themselves functions of t. 

Just as the Chain Rule makes it possible to differentiate complicated functions of 
functions, so the method of substitution makes it possible for us to integrate more 
complicated functions of functions. 

The easiest way of explaining how this works is to look at some particular examples. 
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EXAMPLE (l) / = j 2x (x 2 + 3) 7 dx 

One way to do this would be to multiply out the bracket using the 
Binomial Theorem, and then integrate term by term, but this would be 
very lengthy and tedious. 

There is something much neater and quicker which we can do. 

If we let t = x 2 + 3, we shall then have 

/ = j 2x t 1 dx. 

(I have used the letter t here because I want to save u and v for later 
purposes.) 

This new integral looks a lot nicer in many ways, but it has a 
peculiar mixture of variables which means that it can't be related back 
to any particular curve. 

^^^^ You can't find an integral which has mixed variables in it. 

So how can we get round this problem here? In particular, we need some way of relating 
the dx which we have to the dt which we would like to have. 

We do this by using our chosen relation between x and t of t = x 2 + 3. From this, we can 
say that dtldx = 2x. So therefore we can replace the 2x dx by dt. 

We now have 

t & (x 2 + 3) 8 

1= t 7 dt = — + C = + C, 

J 8 8 

replacing the original variable of x in the final answer. You should always do this unless the 
integral has limits. The next example will show you what to do in this case. 

Notice how fortunate we were in the above example to have exactly 2x dx\ This was not 
by chance - 1 wanted an easy example to start with. When you have done a number of integrals 
like this, you get used to looking for this kind of helpful situation. When it happens, you will 
often find that you can work out what the answer must be in your head but it's wise to check 
back by differentiating; it is horribly easy to get the arithmetic slightly wrong otherwise. 

. i 3x 

EXAMPLE (2) / = dx 

J o Vx 2 + 1 

We can use a very similar process here, to get rid of the problem of 
the complicated square root underneath. 

If we put t = x 2 + 1 then dtldx = 2x. Therefore we can replace x dx 
by 2 dt. (Notice that in this example too we have an x exactly where we 
need it.) 

This then gives us 
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We must now change the limits of the integral so that they fit the new 
variable, t. 

If x = 0 then t = 1, and if x = 1 then t = 2, using t = x 2 + 1. 
So now we have 

/=J i 2 |r 1/2 ^=[3f 1/2 ]^ = 3V2-3. 

(You can replace t by x 2 + 1 and then use the original limits if you 
prefer, but doing this is generally more messy and time-consuming.) 



x 

EXAMPLE (3) /= I dx. 

(x + 5) 



If we let t = x + 5, we have dtldx = 1 so we can replace dx by dt. 
But this will leave us with an x sitting on the top. What can we do with 
it so that the whole integral is in terms of ft 



We can say t = x + 5 so x = t - 5. Doing this gives us 
/= [ { LZlL dt= r r 8 - 5r 9 dt 



r 7 5r 8 
— + — + c 

7 8 

-1 5 

+ + C 



7(x + 5) 7 8(x + 5) 8 

which will tidy up as 

-8(x + 5) + 35 -8x - 5 

C = + C. 



56(x + 5) 8 56(x + 5) 8 

From the examples above, we can make these general rules. 



Integration by substitution 

(1) Replace the nastiest bit by something simple, so that you are working with a 
new function of your own choosing. 

(2) Find the relation between the old dx and the new dt (or whatever letters you 
are using). If this turns out to be rather complicated, you may have to 
rethink step (1). 

(3) Make sure that you rewrite your new integral so that it is completely in 
terms of your new variable. (No stray left-behind xs!) 

(4) Change the limits to fit your new variable if it's a definite integral. 

(5) Integrate! 

(6) Replace the old variable if this was an indefinite integral. 



The common mistake that students make when they write, for example, / sin 3 x dx = 
4 sin 4 x + C shows that their minds are working on the right track. They just haven't gone 
quite far enough. 
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What they are finding is the equivalent of J (lump) 3 d(lump) = |(lump) 4 + C when what 
they've actually got is /(lump) 3 dx. 



exercise 9.B.2 Try these three for yourself now. 



W I , T s. dx W f 1= <** (3) i xU-tfdx 



4 



2x + 6 

EXAMPLE (4) /= I dx 

x 2 + 6x + 25 



This integral looks quite difficult but it is actually very simple to do. 
Can you spot why this is? 



If we differentiate the bottom, we get the top. 

This means that we can write down the answer straight away. 

It must be In |x 2 + 6x + 25 1 + C. Check that this answer works by 
differentiating it back. 

It is also possible to find this answer by substituting u = x 2 + 6x + 
25. This gives 

f du 

duldx = 2x + 6 so / = — = In \u\ + C = In |x 2 + 6x + 25 1 + C 
J u 

but the first method is much simpler once you know that it is worth 
looking out for. 

Sometimes we need to do a bit of juggling with the numbers, so for example, 

4x + 12 f 2x + 6 

dx = 2 dx = 2 In |x 2 + 6x + 25| + C 



x 2 + 6x + 25 J x 2 + 6x + 25 

f 2x + 7 

but dx is not yet possible for us to do. 

J x 2 + 6x + 25 

We shall find out how to do this one in Example (1) of Section 9.B.(d). 
Meanwhile, we have this useful rule: 



if (x) 

if / = I dx then / = In [f(x)\ + C. 

fix) 



We can now also do the first of the three integrals in the thinking point at the end of 
Section 9.B.(a). 
This was 

x 

dx 



1 +x 2 

and the answer is 

1 



In |1 +x 2 | + C. 
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example (5) A special property of 



Suppose we have 



dx. 



dx 



dx. 



We can see from Figure 9.B.1 that this leaves us with (\lx)dx. 



2" 




Figure 9.B.1 



But from Section 9.A.(f) we know that 

6 1 f 2 1 

— dx = In 6 and — dx = In 2. 



Also, 



In 6 - In 2 = In | ^ j = In 3 = j -dr. 



This means that we have found the two different answers of (l/x)dx 
and J" ^ (l/x)dx for the spotty area. What has gone wrong? The answer is 
that nothing has. If we let x = 2t m (1/x) dx, we get 



It 



X2 \ dt 



dt = In 3. 



The letter used here makes no difference to the final answer because it 
disappears when we put in the limits. We have discovered that this area 
can be pushed around without changing its size, like toothpaste in a 
tube. I've shown this in Figure 9.B.2. 

Multiplying or dividing the limits by any constant number will make no 
difference to the resulting area because the effects cancel each other out. 




>3C 
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exercise 9.B.3 Now try these for yourself. You may find that you can do some of these questions 

in your head, just as it not necessary to write down all the details every time you 
use the Chain Rule. It is excellent if you can do this, but you should always check 
by differentiating back that you have found the right answer. 

Always include the dx or du or whatever when you write down an integral. If 
you don't do this you will make mistakes when you substitute a new variable. 

f 2x + 3 f 6x + 2 f lx + 8 
(1) I — dx (2) I — ; dx (3) I dx 



x' + 3X + 9 J ye 1 + 2X + 7 + + 7 

f x dx fx 2 f x 

w J(iF7^ (5) te d * (6, JCT* 

r , f 1 + IX f dx 

(7) JxVi-x 2 dx (8) \-==dx (9) 

J \X + X 



(2X-7) 2 

9.B.(c) A selection of trig integrals with some hyperbolic cousins 

Integrals of trig functions are often involved in solutions to physical problems, so this 
section deals with some of the ones which you may meet. 
We start with a very easy one. 

EXAMPLE (l) / = J COS X shl 6 X dx 

This integral is similar to the ones in the previous section. The 
presence of the cos x means that you can probably write the answer 
straight down. Try doing this. 



In case you couldn't, here's how the integration works by 
substitution. We put t = sin x so dtldx = cos x. We then get 



EXAMPLE 



1 = / t 6 dt = \ f + C = 7 sin 7 x + C. 

- r sin x 
(2) I = J tan x dx = dx 

J COS X 



We can now find the integral of tan x using the same method as in 
Example (1). 

If we let t = cos x then we have dtldx = - sin x, so 
dt 



I = -j — = - In |/| + C = - In |cosx| + C. 
1 

Because sec x = , we can also say 

cos X 

In |sec x| = ln(l) - In |cos x| = - In |cos x|. 
This gives the two forms of 



J tan x dx = In |sec x| + C = - In |cos x| + C. 



This is another result worth surrounding in bright colour. 
9.B Techniques of integration 
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(-jt/3 2 sin x 

EXAMPLE (3) /= I dx 

0 (1 + cos x) 



This one is very easy, because dldx{\ + cos x) = - sin x, so we can 
find it by putting t = 1 + cos x. Try finishing the working off for 
yourself. 



We can replace sin x dx by -dt. 

3 

Also, if x = Jt/3 then t = 5, and if x = 0 then f = 2. This gives 



2 


f 2 2 


1 " 


2 1 


4 


7 


— dt = 












i 3 


■> 3/2 * 




3/2 4 


9 


36 



This is an example of an integral where the limits get swapped over in order of size when 
we do a substitution. We then use the result of Section 9.A.(e) to write them in the usual 
order. 



example (4) Finding / sin 2 x dx and Jsin 3 2x dx. 

I have chosen these two to show you, but the methods we will use on 
them will carry over with only slight adjustments if we replace sin by 
cos or sinh or cosh. 

Both /1 = / sin 2 x dx and I 2 = /sin 3 2x dx are quite straightforward. 

For I x = / sin 2 x dx, we use the identity cos 2x = 1 - 2 sin 2 x. We 
showed this in Section 5.D.(d). Using it gives us sin x = 5 (1 - cos 2x). 
Therefore I x = \ j 1 - cos 2x dx = \x - \ sin 2x + C. 

To find I 2 = /sin 3 2x dx, we cunningly rewrite it to make it into two 
easy integrals. We say 

I 2 = Jsin 2x(l - cos 2 2x)dx = Jsin 2x - sin 2x cos 2 2x dx 

= -\ cos 2x + \ cos 3 2x + C. 

Differentiate this back to see that it is right. 

It isn't possible for me to put in all the trig integrals I want to show you here, because 
some of them depend on future work, so I'll give you a list now of where to find these. 

Where to look for other trig and hyperbolic integrals 

• In Section 9.B.(g) we shall discover how we can get rules to find integrals like / sin" x dx 
where n is a positive whole number. 

• In Section 9.B.(f) we shall discover how to find integrals involving multiplications like 
/ sin 2x e 3x dx, and also what to do with integrals like /sin 4x cos 2x dx. 

• In Section 9.B.(d) we shall find out how we can use the inverse trig and hyperbolic 
functions to give us answers to another collection of integrals. 

• In Section 9.B.(h) we shall discover how to find integrals like 

f 1 

dx 

J 1 + cos x 
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which is surprisingly tricky, unlike 

sin x 

dx 



1 + COS X 

which you can do in your head (yes?) 



EXERCISE 9.B.4 



It is - In (1 + cos x) + C. 



Try these for yourself now. As usual, try to do the easy ones in your head, but 
differentiate back to check the arithmetic. 



, sin 3X 
cos 4 3X 



(1) /sin x cos 5 x dx (2) /sin 2x cos 3 ix dx (3) /cos(x + jr/4) sin 2 (x + jr/4) dx 

f cos x , f cos x 

(5) 7 :— ( 6) |cotxdx= dx 

J (1 + sin x) 3 J J sin x 

(7) Jcos 2 xdx (8) Jsin 2 3xdx (9) Jcos 3 xdx 

(10) J cos 3 5X dx (11) J cosh 2 x dx (12) Jsinh (x + 1) cosh 4 (x + 1) dx 

For the next two questions, if you put t = sin x in (13) and t = cosh x in (14), you 
will turn them into something you can do using partial fractions. 

f^ 6 3 cos x f 7 sinh x 
(13) ~^r^ : 7 dx (14) I dx 



sin 2 x + 5 sin x + 6 



2 cosh 2 x + 5 cosh x - 3 

1/2 



r dx r i/2 . 

(15) J (Put x = 2 sin f.) (16) y](i-x 2 )dx (Putx = sinr.) 

Jo V4-x 2 J o 



(17) 



lljl 



cos X 



(4 + sin x) 2 



dx 



(18) 



dx (Put x = 2 cos f.) 



9.B.(d) Integrals which use inverse trig and hyperbolic functions 

In Section 8.D.(i) and Section 8.F.(c) we found some results which will be very useful to us 
now for finding certain kinds of integral. I have collected the most important of them 
together for you in the box below. 



Four useful rules about integrals 



(a) 
(c) 



dx 



{a^x 1 

dx 
Vx 2 + a 2 



= sin 



-1 



sinh 



-1 



(\x\<a) (b) 



(d) 



dx 



x 2 + a 2 a 



tan 



-1 



dx 



Jx^a 1 



cosh 



-1 



(x > a) 



Note (1) We take the value of a to be positive. If you put a = 1, you will get the 
simplest case of the integral. 
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Note (2) I have not included a constant of integration for any of these integrals 
both for reasons of space and because you may be using these results to 
find integrals with given limits. 

Note (3) Notice the restrictions on the values of x in (a) and (d). We must have 
these to avoid problems like wanting the square roots of negative 
numbers, trying to find angles whose sin is greater than one, or trying 
to find values of x whose cosh is less than 1 . 



Just as when we did the differentiation in Sections 8.D.(i) and 8.F.(c), we can match up 
the as in the box above to fit particular circumstances. For example, we now have the answer 
to the second part of the thinking point at the end of Section 9.B.(a). 

Putting a = 1 in (b) gives 

f dx 

= tan -1 x. 

J x 2 + 1 

The rest of this thinking point is answered in the next section. 



exercise 9.B.5 Try the following three yourself, by picking out the right integral and choosing the 

value of a which will make it work. 

f 3 dx f 2 dx f 4 dx 

to A W ^=f (3) 



V* 2 + 25 J V9 -x 2 J 0 x 2 + 16 

du 

a special warning example What happens if we have / = I - ? 



V9 - 4u 2 ' 

Can you write down the answer for this one? Be careful! 
There is something about this integral which makes it not 
entirely straightforward. 



There are two possible approaches to doing this one. 

The first is to use substitution, putting u = 5 sin 9, to simplify the square-rooted 
expression. 

3 

This gives us du/dv = 5 cos 9, and 



, f cos 9 , f cos 9 1 

1=1 d6 = \ dd = \d + C. 

J V9 - 9 sin 2 9 J 3 cos 9 

Now fw^sin© so 9 = smT l (\u). This gives us /= \ suT 1 + C. 

Did you include the 5 in your answer? This is what makes this integral not quite 
straightforward. (Excellent, if you got it!) 

The other way of approaching this integral is to use Rule (a) above. 

In this case, there are again two things which you can do. 

The first is to convert the integral into the same form as (a) by rearranging it so it has u 2 
instead of 4m 2 underneath. 
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This is probably the simplest approach and we do it as follows: 



/ = 



du f du j f du 

2 



V9 - 4u 2 J 2 ^u 2 - 1 ^ 



Now a 2 = f so a = \ (since a is positive). This gives /= \ sin -1 (| w) + C. 

Alternatively, you can put x = 2m and a = 3. If you do this, you have to remember 
that putting x = 2u will give dxldu = 2 and so du must be replaced by \ dx. 

Doing this gives the same answer yet again of / = 5 sin - 1 (3 w) + C. 

Notice, too, that I've made the integral easier to see by taking the constant multiplying 
number outside it. We can do this because it is just a scaling factor for the area if the integral 
has limits. (You can see this happening in questions (3) and (4) of exercise 9.A.3.) 



Never make an integral look nicer by taking a variable like an x outside. It may 
look better but it means something completely different. 



Three examples which use the technique of completing the square 

f 2x + 7 

example (1) In Section 9.B.(b), we briefly met the integral / = dx. 

J x 2 + 6x + 25 

We could not do this then, but we did find that 
2x + 6 

dx = lnlx 2 + 6x + 25 1 + C. 

x 2 + 6x + 25 

We therefore split / to make use of this, writing 

f 2x + 6 f 1 

/ = l x + I 2 = dx + dx. 

] x 2 + 6x + 25 J x 2 + 6x + 25 

We then make it possible to use the inverse tan rule on I 2 by cunningly 
completing the square underneath. 
We say 

x 2 + 6x + 25 = (x + 3) 2 - 9 + 25 = (x + 3) 2 + 16. 
So we have 



dx j / x + 3 

/, = I = 4 tan- 1 ! I + C 

(x + 3) 2 + 4 2 



using Rule (b) for the inverse tan from the last section with 'x' = x + 3 
and 'a' = 4. 
Therefore 



1 (x + 3 \ 
/= ln|x 2 + 6x + 25 1 + 5 tan" 1 \ —- j + c - 
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example (2) Sometimes you need to do some juggling with the numbers to make the 
rules fit the particular circumstances. For example, suppose you have 

5* + 3 

dt. 



t 2 + 10* + 29 



Differentiating the bottom with respect to * gives 2* + 10, so we 
rearrange the top to take advantage of a log integral getting rid of the * 
for us. We can do this by saying 

, f 10* + 6 , f 10* + 50 - 44 

7 = 5 ~ 7 dt = \\ dt 

J t 2 + 10* + 29 J * 2 + 10* + 29 



2* + 10 f 22 

dt - dt 



t 2 + 10* + 29 J f 2 + 10* + 29 

dt 



= | ln|* 2 + 10* + 29|-22 J 



(* + 5) 



2 



Now, we do the second integral using Rule (b), and letting 
x = * + 5 and a = 2. This gives us 

/ = f In |f 2 + 10* + 29| - 1 1 tan" 1 ( ^— ) + C. 



example (3) Sometimes we have to be careful about applying Rule (b) when we have 
completed the square, just as in the special warning example which we 
had earlier. Suppose we have 

3dt f 3dt 

1 = 



4t z + 12* + 25 J (2* + 3) 2 + 16 

We put a = 4 and x = 2* + 3. Doing this substitution gives dxldt = 2 so 
we must replace dt by \ dx here. / now becomes 

dx = I X 3 tan" 1 | - | + C = I tan" 1 | — — - ] + C. 



x 2 +16 \4 

If you do this replacement step in your head you must remember to 
include the extra \ which comes from replacing 2* + 3 by x. 
Alternatively, you can say 

3dt , f 3dt 



At 2 + 12* + 25 4 J * 2 + 3* + f 



! I ^ . 9 , 25 . 

since -j + t = 4. 



4 



(f + |) 2 + 4 



3 

Now, using Rule (b) with x = t + 5 and a = 2 we get 

2* + 3\ 3 ,/2* + 3 
/ = 4 X 5 X tan" 1 I I = 5 tan" 1 



This method involves more fractions but means you can use the tan 1 rule 
directly, without having to make adjustments between the dt and the dx. 
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EXERCISE 9-B.6 



Now try finding the following integrals yourself. 
dt f dx 



(i) 
(4) 
(?) 



9 + t 2 

¥ + 3 
t 2 + 8t + 17 

3U + 2 
u 2 + 6u + 13 



(2) 

dt (5) 
c/u (8) 



x 2 + 6x + 10 
dx 



4* + 9 
3t/u 



9U 2 + 24U + 17 



(3) 
(6) 
(9) 



2X + 5 



t/x 



x 2 + 4X + 13 
3t/x 



9X 2 + 16 

3du 
2U 2 + 2U + 5 



9.B.(e) Using partial fractions in integration 

Being able to write single complicated fractions in the form of two or more much simpler 
fractions makes integration much easier. I have described how we can do this in Section 6.E. 
If you are at all unsure about how to find partial fractions, you should go back there now and 
check it out before going on. 

Here are three examples showing their use in integration. 



EXAMPLE (l) 



10 



dx 



x(x - l)(x + 4) 



In question (3) of Exercise 6.E.1 we found that 



10 



so 



= — + + 

x(x - l)(x + 4) x x - I x + 4 

'-- 2 

-* +- 
x x 



+ 



1 x + 4, 



dx 



= -f In \x\ + 2 In \x - l| + \ In \x + 4| + C. 



It is much easier to do the integration if you keep the number fractions on 
the top as I have done, and think of them as scaling factors multiplying each 
answer. You can see how this works in the answers to questions (3) and (4) 
of Exercise 9.A.3. 



example (2) This will give you the answer to (3) of the thinking point at the end of 
Section 9.B.(a). I asked you there if you could find 




You might know now what to do even if you didn't then. If so, try it 
now before looking at what I have done. 



We can use partial fractions to say that 

1 1 



[ — l —dx= [( — ^ — + — ^ — ] c/x = 5 In 1 1 +x|-|ln|l-x| + C. 
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You might also have realised that you could use the result that 
d 1 

— (tanh 1 x) = (this was question (3) of Exercise 8.D.3) 

dx 1 - x 

so you could say that / = tanh -1 x + C. 

Does this mean that we've got two different answers? 

No, because if we use the laws of logs on the first answer, we get 

/ 1 +x 
and we know that 



1 +x\ l+x 



tanh 1 x = j In = In 

\ 1 -xj V 1 -x 

from Section 8.D.(g). 

When we combine the logs together to give a single log in this way, 
it is often convenient to include the constant term in this log too. We 
can do this by letting C = In A. This then gives us the answer that 



1 = In \A- 



1 + x 
1 -x 



We shall find that this kind of combination of an answer into a single log will be particularly 
useful when we solve some differential equations in Section 9.C. 

example (3) What will happen if the partial fractions which we find involve the 
special cases of Section 6.E? Will it still be easy to integrate the 
answer? The following integral has examples of two of these special 
cases in its second and third fractions. 

ff 2 1 2x + 3\ 

J \x - 1 (x- l) 2 x 2 + 4J 

Try writing down the answer to it yourself. 



Three separate integrals make up the answer here. The first two are 

[2 ( 1 

Ii = dx = 2 In \x - l| and I 2 = \(x - 1) dx = . 

J x - 1 " J x - 1 



1 2 isn't a log! 



/ 3 is a little bit tricky, but we know how to do it from Section 
9.B.(d). We have 

r 2x + 3 f 2x f 3 

L = dx = dx + dx 

) x 2 + 4 J x 2 + 4 J x 2 + 4 

= ln |x 2 + 4| + |tan~ 1 (x/2). 
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This gives a complete answer of 



/ = In [(x - l) 2 (x 2 + 4)] - — *-y + | tan" 1 ( * ) + C. 



We know from looking at it that the inside of the log above must be 
positive. 

example (4) Suppose we are given the three similar-looking integrals below. 

f 2x f 2x + 3 f dx 

h = 7 dx I 2 = \ — — dx I 3 



x 2 + 5x + 6 J x 2 + 3x + 4 j x 2 + 8x + 25 

Try finding each of these for yourself before looking at what I have 
done with them. 



f 6 4 

I x = dx = 6 In |x + 3 1 - 4 In |x + 2| + C. 

J x + 3 x + 2 

To find I 2 , we notice that x 2 + 3x + 4 will not factorise. Indeed, it has no 
real factors. But dldx (x 2 + 3x + 4) = 2x + 3. So I 2 = In |x 2 + 3x + 4 1 + C. 

Now for 7 3 . We can't factorise x 2 + 8x + 25, and there is no handy 2x + 8 
on the top, so it isn't a straightforward log. This is an integral which uses 
the methods of the last section. 

Putting 'x' = x + 4 and 'a' = 3 and using Rule (b) from there, we get 

dx , , / x + 4 v 

(x + 4) 2 + 9 



h = I ; = han- ! | | + C. 



exercise 9.B.7 Try the following for yourself now. 

d)f 7 £ rdx (2) f- -dx (3) f— dx 

J (X + 2)(X + 3) J (X - 2)(X + 3) 2 J X + 1 

(4) ( , 2 4 tfy (5) ( - 3f J a (6) [- ^ — -rfy 

J y(K 2 + 1) J (2f-i)(f + 2 ) 2 J (k- i)(k 2 + 9) 

f 10X f f 2 + 1 f X 2 + 1 

(7) (8) dt (9) rfx 

J (x-i)(x 2 - 9 ) J r 2 -i J (x+2)(x + 4) 

r 1 r o 2 -i r e x 

(lo) "1 2 ^ (") ^7 r du (12) — 1 — dx 

J y y J u (2U + 1) J e 2x + se x + 6 

9.B.(f) Integration by parts 

Suppose that you need to find / x cos 3x dx. 

If instead you had wanted to find J (x + cos 3x) dx, you would just have integrated each 
bit in turn, giving you the answer of 1 x 2 + 3 sin 3x + C. Is it true that 

I = jx cos 3x dx = (\ x 2 )(| sin 3x) + C = \ x 2 sin 3x + CI 

Try differentiating this back and see what you get. 
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At this point you should have decided that this method won't work. 

We have to use the Product Rule to differentiate i% 2 sin 3x, and doing this gives us 

— (| x 2 sin 3x) = | x sin 3x + \ x 2 cos 3x. 
dx 

(If you are at all unsure about this last step, you need to go back to Section 8.C.(d) 
urgently before going on any further with this section.) 

To do integrals which are made up of two functions multiplied together, like the one 
above, we use the Product Rule turned backwards. We used the Chain Rule turned 
backwards, in a very similar way, to let us do integration by substitution. 

In Section 8.C.(d), we wrote down the Product Rule in the form 

d du dv 

— {uv ) = v — + u — 
dx dx dx 

where u and v are both themselves functions of x. 

Integrating both sides with respect to x, and remembering that integration is the reverse 
process to differentiation, gives us 

f du f dv 
uv = \v — dx + \u — dx. 
J dx J dx 

We then rearrange this so that we get 



The rule for integration by parts 

dv f du 

u — dx = uv - \v — dx. 
dx J dx 

This is also sometimes written in the form 

juv' dx = uv - jvu' dx. 



^^^^ As we have just seen, it isn't true that you can find / uv dx by integrating 
^B-^V each of u and v and then multiplying these answers together (though 
students quite often do this). 



I will now show you how the above rule works by taking some examples. 

example (1) We'll start with the correct version of / = / x cos 3x dx. 

Here, we can easily differentiate and integrate both x and cos 3x, but 
differentiating x looks promising since it gives us something simpler. 
Integrating x would gives us a more complicated integral to find 
involving 5 x 2 . Therefore we let 

du dv y 

u = x so — =1 and — = cos 3x so v = 3 sin 3x. 
dx dx 
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You should always write down these bits of working because it helps 
you not to make arithmetical slips, and it makes it easier to check back 
if something should go wrong. You don't need to worry about putting in 
a constant when you integrate dvldx; all we will need is a constant of 
integration in the final answer if the given integral has no limits, and so 
we are not finding a definite area. 

Next, we use this working to substitute into the rule above. Doing 
this gives us 

I = jx cos 3x dx = | x sin 3x - j \ sin 3x dx 

and we see that we've now got a nice easy integral to do, so giving us 
the final answer of 

/ = | x sin 3x + \ cos 3x + C. 

Differentiate this answer back for yourself to check that it really does 
work. 

If we had started with / = / x 2 cos 3x dx, we would have done 
exactly the same sort of thing, with u = x 2 and dvldx = cos 3x, and then 
used the method of integration by parts twice. 

example (2) A tricky pair. Suppose we have I x = fxe x dx and I 2 = fxe* 2 dx. 
Have a go at doing each of these yourself. 



We can use integration by parts for 7\ by putting u = x so du/dx = 1 
and dvldx = e x so v = e x . Doing this gives us 

7i = xe x - je x dx = xe x - e x + C. 

When we come to try I 2 , we can't use integration by parts because we 

2 

can't integrate e x . (If you thought that you could try differentiating 
back!) If, on the other hand, we make x = dvldx, the integral we have to 
find gets worse instead of better. 

The answer here is to do the substitution x 2 = t so that 
dtldx = 2x and x dx can be replaced by \ dt. Doing this gives us 

I 2 = \ je'dt = \e* + C = \e x2 + C. 

You might perhaps have spotted that this answer will work just by looking at the integral. 
This example shows us that not every integral which is made up of two functions multiplied 
together will respond to being attacked by the method of integration by parts. 

EXAMPLE (3) I=jx 3 lnxdx 

Integrating In x is tricky (we shall see how to do it in the next example), 
but differentiating it is easy. Also, although getting a higher power of x 
often makes an integral worse, in this particular case it won't cause us a 
problem because we also have a log. 
We let 

du 1 dv x 4 

u = In x so — = — and — = x so v = — . 

dx x dx 4 
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This gives us 



/ = jx 4 In x - I ( — x 4 X — ] dx 



(I have used X to show the multiplication inside the integral.) 
= \ x 4 In x - \ jx 3 dx = \ x 4 In x - ^ x 4 + C. 



example (4) Now we can find I = jinx dx. 



To do this, we use a very useful trick which is to think of In x as being 
1 X In x. 

Then 7 = / 1 X In x dx, and we put u = In x 
so duldx = l/x and dvldx =1 so v = x. 
This then gives us 

r r 1 

I = \ln x dx = x In x - \x X — dx = x \n x - x + C. 

1 1 x 

example (5) This is another example which uses the same cunning trick. 

If we want 

■ 1/2 

tan -1 (2x) dx 

Jo 

then we write it as 

■ 1/2 

1 X tan -1 (2x) dx. 

Now we let u = tan _1 (2x) so 

du 2 dv 

— = and — =1 so v = x. 

dx 1 + 4x dx 

(This step uses the work of Section 8.D.(i).) 
This gives 

■ 1/2 2x 

dx 



I=\x tan" 1 (2x) - 

L Jo Jo 1 + 



4x 2 

= \(jtI4) -\\n\ \ +4x 2 |] llL = jt/8 - \ In 2. 



All the inverse trig and hyperbolic functions can be integrated by using 
this method. 



example (6) Now, suppose we want j5 x dx. 



Students often find this integral rather awkward to do. The cunning trick 
which we used above won't work because it actually makes things 
worse, so what can we do? We can think that 5 X behaves very much (but 
not quite) like e x , so we look what happens if we differentiate it instead. 

If y = 5 X then In y = In (5 X ) = x In 5 so (\/y)dy/dx = In 5 
differentiating implicitly, like we did in Section 8.F.(d). 
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Therefore, if y = 5 X , we have 
dy 

— = yln 5 = (In 5)5 X . 
dx 

Therefore 

|5 X <& = (-j-j) 5 " + C 
since integration is the reverse process to differentiation. 

example (7) j e 3u sin 2w du 

The safest first step here is to replace u by x so that you don't get 
into a horrid confusion of us which mean two different things. Since 
this integral has no limits, we shall have to replace x by the given 
variable u as our last step. If an integral does have limits, so that it is 
finding a particular area, you can rewrite it using any letter of your 
choice - the answer will come out exactly the same. For this reason, the 
letter that we use is called a dummy variable. This worked in exactly 
the same way when we used 2 in Section 6.D.(a). 

We start, then, by rewriting / as 

j e 3x sin 2x dx. 

Both of e 3x and sin 2x are easy to differentiate and integrate, and it 
doesn't matter which way round you choose your u and dvldx here. 

I will put u = e 3x so duldx = 3e 3x and dvldx = sin 2x so 
v = — j cos 2x. This then gives me 

I = -\ e 3x cos 2x + j \e 3x cos 2x dx. 

At this stage, you may think that things are no better, since we have a 
very similar integral to find to the one which we started with. However, 
if we repeat the process of integration by parts on this new integral, we 
will find that everything will work out very nicely. 



^■^^ It is very important at this stage to stick to the same sort of choice that we 
^B-^V made at the beginning. If we don't do this, the whole thing unravels like 
^* knitting and we finish up with / = / which, though true, is not very helpful. 



This means that I must put u = e 3x so duldx = 3e 3x and 
dvldx = cos 2x so v = \ sin 2x. This then gives me 

j e 3x cos 2x dx = 5 e 3x sin 2x - g3x srn 2x dx. 

Substituting this back in the original equation gives 

I = -\ e 3x cos 2x + I ^5 e 3x sin 2x - § j e 3x sin 2x dx j. 

Again you may think that we are not getting anywhere at all as we now 
have the same integral back again that we started with. But, if we call 
this /, you will see that we are in a good position after all. 
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We have 

I = -\ e 3x cos 2x + | e 3x sin 2x - \ I 
and rearranging gives us 

^ / = \ e 3x (-2 cos 2x + 3 sin 2x). 
Finally, we must replace x by the original u. Doing this gives us 

/ = Yi e 3u (3 sin 2u - 2 cos 2u) + C 
putting in a constant of integration. 



example (8) J sin 4x cos 2x dx 

We can do this in the same way as the last example by using integration 
by parts, but it is very much quicker (and safer from arithmetical slips) 
to use the rule 

sin A cos B = j [sin (A + B) + sin (A - B)] 

from Section 5.D.(h). 

Here, A = Ax and B = 2x so we get j (sin 6x + sin 2x) giving 

/ = \ j sin 6x + sin 2x dx = \(- \ cos 6x - \ cos 2x) + C 

= -j2 cos 6x - \ cos 2x + C. 



exercise 9.B.8 Students quite often make slips with the little bits of differentiating and 

integrating which come in the working steps for integration by parts, so this first 
question starts you off with some extra practice on this. 

(1) For each of the following, write down either du/dx (or du/dt if the function is 
in terms of t), or v as required. 

(a) u = e 5 * (b) u = sin u,t (c) dv/dx = (d) dv/dt = e~ l 

(e) u = In (2x) (f) dv/dt = sinh 2f (g) dv/dx = sin 3X (h) u = cos 2x 

Check your answers to this question before going on to integrate the following. 

(2) /2f sin 3f dt (3) fxe* dx (4) fx 2 sin x dx 

(5) (a) fx 2 In x dx (b) f 1 - In x dx (c) Jx 2 (In x) 2 dx 

(6) x 2 tan" 1 x dx (7) e~ x sin x dx (8) JV f cosh 2f df 
(9) /sin zjx sin 3X dx (10) /cos 3f cos 6r dt 



9.B.(g) Finding rules for doing integrals like /„ = / sin" x dx 

What we want to do here is to find a pattern which will take us down from /„ to I n _ x or 
possibly to I„_ 2 . (Which of these we get will depend on the particular integral we are 
considering.) If we then find I 0 and I x , we shall be able to find /„ for any positive whole 
number value of n. 

I will show you how this actually works by taking some examples. 
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r Jt/2 

example (1) If /„ = J sin" x dx then answer the following. 

(a) Find I 0 and I x . 

(b) Find a rule which links I n \o I n _ 2 - 

(c) Use (a) and (b) to find 4> 4> / 5 and I 6 . 

(a) Since (sin x)° = 1 , we have 

f ji/2 r -iJr/2 

In = \ ax = \ x\ = Till 

Jo L J 0 

rTtil r -| JT/2 

/, = sin x dx = - cos x\ = 1 . 
Jo L J 0 



(b) Now we use integration by parts on /„ because this will make it 
possible to find a pattern which will take us down from the starting 
number of n. This means that we are using the work of the previous 
section. 

First, we have to decide on the best way to split the integral into 
two parts, remembering that we must have one part that we can 
integrate easily. 

Here, the most promising idea seems to be to write 

I n = J sin x dx = J sin x X sin x dx. 

(It would not be a good idea to choose J^ 2 1 X sin" x dx because 
integrating the 1 would give us an x which would just be an 
embarrasment.) 

Now we let dvldx = sin x so v = - cos x and u = sin" - 1 x so 
du 

— = (n - 1) sin" -2 x(cos x). 
dx 

Using the rule for integration by parts gives us 

[, -| Jt/2 r Till ^ 

-cosxsin x - (-cosx)((« - 1) sin" x (cosx)Wx. 
Jo Jo 

Now, 

[, -| jt/2 
-cos x sin" - xj = 0. 

This gives us 

4 = (n - 1) \ 7112 cos 2 x sin" -2 x dx. 

We are interested in sines here, so we use the identity cos 2 x = 
1 - sin 2 x. This gives us 

I„ = (n - 1) [ j™' 2 sin" -2 x dx - j*' 2 sin" x dx]. 

But this is the same as 

I„ = (n- 1) 4-2 - in - 
Therefore 

4 + (n-i)4 = («-i)/„_ 2 so i n = \ r —\i n -2- 
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We now have the rule which gives us the pattern of how the 
integrals run down from n. A rule like this is called a reduction 
formula. Notice that this particular rule is going down in double 
jumps from n to n - 2. This means that we need to know both I 0 
and Ii to be able to use it for all values of n. 

(c) Using the rule we have just found and the answers from (a), we can 
say that 

1 3 3 

h = ih = and I 4 = 5 I 2 = 4 X jt/4 = 3jt/16 and 

r_2,_2 , r_4 J ._4 2__8_ 

i 3 - 3 ii - 3 ana i 5 - 5 1 3 - 5 x 3 - 15 . 

f e 

EXAMPLE (2) \fl n =\ ln n xdx, answer the following. 

(a) Find I 0 . 

(b) Find a rule to write /„ in terms of /„ _ 1 . 

(c) Use this rule to find /j , I 2 and / 3 . 

(a) h = \[ dx= [x]* = e- 1. 

(b) This time it looks best to split the integral as 

4 = j e 1 X In" x <ix 

since getting an x from integrating the 1 will be no problem. If we 
do this, we are working in a very similar way to Example (4) in 
Section 9.B.(f). We will have 

dv 

— =1 so v = x and u = In" x 

dx 

du I 1 \ n 

so — = (nln"- 1 x) - N-ln"- 1 *. 
dx \ x j x 

Therefore 

/„ = fx In" xX - [ e x X - In"" 1 x dx. 
L J 1 J i jc 

Now In e = 1 and In 1 = 0, so we have 
I„ = e-n I n _ x . 

This gives us the pattern of how the integrals run down from n. 
Notice that it is just single jumps this time, so we only need to 
know the value of I 0 to use it. 

(c) We now find I x , I 2 and 7 3 . 

11 = e - 1 0 = e - (e - 1) = 1 

1 2 = e - 2I\ = e - 2 

1 3 = e-3I 2 = e-3(e-2) = 6-2e 



404 



Integration 



example (3) Find a reduction formula for 

/■jr/4 

/„ = J tan" x dx for n > 2 
and use it to evaluate I 2 , I3 and I 4 . 

This time we have to be a bit ingenious with our choice of split. Using 
jjt/4 j x tan" xdx will not work because it will give us an awkward x 
to deal with. (We want an answer involving tans.) 

Using J*q /4 tan x X tan" ~ 1 x dx doesn't look very good either since 
/tan xdx = -In (cos x) which will make things much more complicated. 

So, can we do anything with 

I"' 4 tan 2 x X tan"" 2 xdx? 
Jo 

/tan 2 x dx is not something which we can immediately do, but we can 
rewrite this in a way which makes it much easier to handle by using the 
identity tan 2 x + 1 = sec 2 x. 
Now we have 

fJt/4 ry _~ 

I n = tan x X tan" x dx 

fJt/4 _ j 

= (sec x - 1) tan" x dx 
Jo 

so 

fJT/4 



In 



sec 2 x X tan" 2 x dx - 1, 



n-2- 



The neatest method of going on from here is not to use integration by 
parts at all, but to use the substitution t = tan x. This then gives us dtldx 
= sec 2 x so we can replace sec 2 x dx by dt. 

Also, if x = jtIA then t = 1 and if x = 0 then t = 0. This then gives us 

- 2 dt 



I"' 4 sec 2 x X tan"" 2 x dx = f 1 t"~ 
Jo Jo 



t 



n-l 1 1 



1 



SO 



1 



'n-2- 



We can now see why we had to say n > 2 at the beginning. We can't 
have n = 1. 



To find I 2 , 13 and I 4 , we shall first need to find both / 0 and I x because 
again we have a rule which is going in double jumps. 

7 0 = J o dx=[x\ Q 



jr/4 



jr/4 f 7r/4 Sin X 

tan x ax = ax 

J ° cos x 



= [-lncosx]^ =-ln(l/V2) = lnV2. 
Also, I 2 = 1 - I 0 = 1 - and 7 3 = 5 - I x 



and L = 



(1 - jt/4) = Ji/ 4 



In V2. 
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exercise 9.B.9 (1) (a) Find a rule which relates /„ = J"^ /2 cos" x dx to /„_ 2 . 

(b) Find l 0 and /, and then use these and the rule you have found to evaluate 
/ 2 , / 3 and / 4 . 

(c) Compare these answers with the corresponding answers for example (1). 
What do you notice? Draw sketches of /, and / 2 in each case to see how 
this happens. 

(2) The similarities in the answers to question (1) and Example (1) apply to many 
sin and cos integrals over particular equal intervals. Such integrals are very 
important for finding Fourier series so I am asking you to find some of them 
here, since you have now met all the necessary methods. (Fourier series are 
used to describe various wave functions in terms of sin and cos functions. You 
were able to get an experimental idea of how this might be possible in 
Section 5.c.(c).) 

Find the answers for the following four integrals if n and m are non-zero 
whole numbers. 

(a) J^. sin nx dx (b) § n cos nx dx (c) J^. sin 2 nx dx 

(d) J"^. sin nx cos mx dx if (i) m*n and (ii) m = n. 

Draw sketches showing the areas involved for the case when n = m = 1. 

(3) If '« = / cosh" x dx, show that nl n = cosh" -1 x sinh x + (n - 1) /„_ 2 . 

(4) If /„ = / sec" xdx, show that, forn > 2, (n - 1) /„ = sec" " 2 x tan x + (n-2) /„_ 2 . 
Use this result to find l 6 , and hence to show that J^ /4 sec 6 x dx = f|. 

9.B.(h) Using the f = tan (x/2) substitution 

It is surprisingly hard to tell how difficult it will be to find an integral just by looking at it, 
for example, we have seen already that /sin 2 x cos xdx is easier to do than J* sin 2 xdx. 

Similarly, I x = J(l/cos x) dx is considerably trickier to solve than I 2 = J(l/cos 2 x) dx. Can 
you recognise I 2 , even though it's slightly disguised? 



I 2 = [sec 2 xdx = tan x + C because — (tan x) = sec 2 x. 

dx 

But how could we find 7\? You just might know what it is, because we found in question (5) 
of Exercise 8.C.5 that 

d 

— (In (sec x + tan x)) = sec x. 

dx 

However, this is not the kind of answer which springs immediately to the memory. 
How can we show that this is the correct answer, starting from I x l 
In order to do this, we let t = tan(x/2). To show you how nicely this improbable 

substitution works, I will start with a simpler example and then come back to I x . My simpler 

example is 




dx 



First, we have to see how we can replace sin x, cos x and dx in terms of t. 
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We already know that 

2 tan A 

tan 2A = from Section 5.D.(d). 

1 - tan 2 A 

From this we can say 

2t 



tan x = 



l-t 2 

Also, we can say 

sin x = 2 sin (x/2) cos (x/2) = 2 tan (x/2) cos 2 (x/2) 

2 tan (x/2) 2 tan (x/2) 



so 



sec 2 (x/2) 1 + tan 2 (x/2) 
2t 



sin x = 



1 + t 2 

using the two identities sin 2A = 2 sin J cos J and 1 + tan 2 A = sec 2 ^4. 
sin x 

Since tan x = , we also have 



cos x 



l-t 2 



COS X 



1 + 1 2 

Finally, if t = tan (x/2) then 
dt 



dx 



= \ sec 2 (x/2) = \ (1 + t 2 ) 



again using the identity 1 + tan 2 A = sec 2 A. 

At this point, your courage may be beginning to fail, but we now have all the information 
we need to do any integral which needs a t substitution. I am putting it in a box for you. 



What you need when you use the substitution t = tan (x/2) 



2t 



sin x 



1 + f" 



l-t 2 2t 
cos x = , tan x = 



1 + t 2 



l-t 2 



and dx = 



2dt 

iT7 2 



Now we see how it works, using 
1 

dx. 



1= . 

1 + sin x + cos x 

Substituting into this, using the information from the box above, gives 

1 \ / 2 dt ' 

1 = 



1 



2 ^) + n -'' 



1 + t 2 



This looks far worse than what we started with, but multiplying the whole of the bottom of 
the first fraction by the (1 + t 2 ) tidies it up amazingly to give 

dt 



f dt 

= In 1 1 + t\ + C = In |1 + tan (x/2)| + C. 

J 1 + t 
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example (2) Now we return to 7\ = 



1 



dx. 



cos x 

Substituting in a similar way to the previous example gives us 

1 f 1 / 2 dt \ f 2 dt 

dx 



COS X 



1 - r 
TT7 



\+t 2 



l-t 2 



The integral has now turned into something we know how to do. 
We can use the method of partial fractions to give us 



h = 



dt 



dt 



l-t } 1 + t 
Putting back tan (x/2) for t gives us 
1 + tan (x/2) 



= - In 1 1 - ?| + In 1 1 + ?| + C = In 



1 + t 



C. 



I x = In 



1 - tan (x/2) 



C= In 



cos (x/2) + sin (x/2) 



cos (x/2) - sin (x/2) 



C. 



We are not quite there yet! We want to go back from the half-angles to 
an answer in terms of the whole angle, x. 
We do it like this: 

cos (x/2) + sin (x/2) (cos (x/2) + sin (x/2))(cos (x/2) + sin (x/2)) 

cos (x/2) - sin (x/2) (cos (x/2) - sin (x/2))(cos (x/2) + sin (x/2)) 

cos 2 (x/2) + 2 sin (x/2) (cos (x/2) + sin 2 (x/2) 



cos 2 (x/2) - sin 2 (x/2) 



1 + sin x 



= sec x + tan x 



COS X 



using the identities sin 2 A = 2 sin A cos A and cos 2A = cos 2 A - sin 2 A. 
This finally gives us 



f 1 

dx = / sec x dx = In | sec x + tan x | + C. 

J cos x 



Because this result is useful but rather hideous to prove, it is often given on formula 
sheets, and I have put it in a box for you for this reason. It is by far the most complicated 
example of this type that you are likely to meet because of the rearrangement at the end. The 
three in the exercise below are easier. 



EXERCISE 9.B.IO 



Try these for yourself now. 

(1) 



(2) 



W/2 dx 



(3) 



2 + 3 sin x + 2 cos x J o 1 + cos x 
Show that (1) is 0.305 to 3 d.p., (2) is 1 and (3) is In (^3) 



2JI/3 



dx 



2 + cos x + 2 sin x 
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9.C Solving some more differential equations 

Differential equations are equations which include terms like dyldx and d 2 y/dx 2 . We have 
already solved some simple examples in Exercise 9.A.1 and Exercise 9.A.2. All of these 
could be integrated straight away because they were given in terms of the variable with 
respect to which the differentiation had been done. (An example of this is v = 3t 2 + 2. Since 
v = dxldt, we have x = t 3 + 2t + C.) 

This section shows you how to deal with some examples which have mixed variables. 

9.C.(a) Solving equations where we can split up the variables 

In all the cases which we will solve here, it will be possible to split up the variables so that 
we finish up with two integrals, each entirely in terms of its own variable. Here is an 
example of such a differential equation. 

3 -x 

If dyldx = , 

y-2 

find the equation of the curve which has this gradient, given that y = 5 when x = 7. 

We split up the diffential equation so that all the ys are together and all the xs are together, 
and then write both sides as integrals. Doing this gives us 

2 2 

y L x z 
J (y - 2) dy = J (3 - x) dx so — - 2y = 3x - — + C. 

(We only need one constant of integration because this takes care of the combined constants 
on both sides.) 
Now we have 

x 2 + y 2 _ 4y _ 6x = 2C 

and putting y = 5 and x = 7 gives 2C = 12. 
We now have 

x 2 - 6x + y 2 - Ay = 12 or (x - 3) 2 + (y - 2) 2 = 25. 

This is the same circle which we used as an example for implicit differentiation in Section 
8.F.(a), so we know that we have found the right answer. In fact, we have just done the 
reverse process here to the one that we did there. 

Students sometimes find the splitting up in this kind of equation a little bit difficult, so 
we will get some more practice in this before moving on to some physical examples. 

example (i) tan x dyldx = 1/y sec 2 x and y = 0 when x = nlA 

Separating the variables and writing both sides of the equation as 
integrals, we get 



Jy dy 



sec 2 x 



dx. 



tan x 



Try doing the integration for each of these sides yourself. (The RHS 
is easier than it looks because dldx (tan x) = sec 2 x.) 



The LHS = j 2 /2, and the RHS = In |tan x| so we have \ y 2 = 
In |tanx| + C. 

Also, y = 0 when x = jz/4, so 0 = 0 + C giving C = 0, and we finish 
up with the solution y 2 = 2 In |tan x|. 
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example (2) (x + 1) (y + 1) = xy dyldx 

Separating the variables, and writing both sides of the equation as 



integrals, we get 

' x + 1 



dx 



y 



y 



1 



dy. 



Both these fractions are top-heavy, so we tidy up and rewrite as 



|(1 + 1/x) dx 



y 



1 - 1 



y 



1 



dy = 



1 



y 



1 



dy 



giving 

x + In |x| = y- In \y + l| + C so In \x(y + 1)| = j - x + C. 
We can also write this as 



x(y + 1) = e 



y-x + c 



c y-x 



e e 



If we now write the constant of e c as A, we get x(y + 1) = Ae y x . 
We could also have got this answer by letting C = \nA. 

example (3) If dyldx = (2y + 3) (y + 2) 

(a) find x in terms of y, if x = In (3/2) when y = 0, 

(b) from this, find j in terms of x. 

Separating the variables, and writing both sides of the equation as 
integrals, we get 



J dx 



dy 



1 



2y + 3 j + 2 



dx 



(2y + 3)(y + 2) 

writing the second integral in partial fractions. 
Integrating both sides gives 

x = In \2y + 3| - In \y + 2\ + C. 

This can be tidied up better if we let C = In A. Then we have 

' 2 j + 3 



x = In 



A 



y 



Putting x = ln(j) and _y = 0 gives ^ = 1, so we have 
2y + 3 



x = In 



and this is the answer to (a). 



To find the answer to (b), we start by doing the reverse process to 
taking logs on both sides of the answer above. This is sometimes called 
antilogging. Doing this gives us 

\ y + 2 

so ye x + 2e x = 2y + 3 
so y(e x - 2) = 3 - 2e x so y 
This is the answer to (b). 



3 -2e x 
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exercise 9.C.1 Try finding the general solutions to the following differential equations yourself 

now. Each of these must include a constant of integration. In each case, I then 
give you further information, as in Example (3) above, so that you can find the 
individual solution which fits that particular case. You can see the difference 
between these two kinds of solution in Section 9.A.(a). 

Remember that you must do your rearranging so that each integral only 
involves one variable. (It is possible to do this for each of the questions given 
here.) Remember, too, that the dx or dy or whatever must be on the top of any 
fraction inside its integral. (I have sometimes seen desperate attempts at 
integration when it has been on the bottom.) 

(1) y(dy/ dx) = cos x with y = 1 when x = jt/6 

(2) dx/dt = 3X with x = 1 when f = o 

(3) x 2 (dy/dx) = y 2 with y = \ when x = 1 

(4) y 2 (dy/dx) = x 2 with y = 2 when x = 1 

(5) 2xy(dy/dx) = y 2 + 1 with y = 0 when x = 1 

(6) (x - 5) (dy/dx) = 2/ with y = 2 when x = 4 

(7) cot y (dy/dx) = x with y = ji/2 when x = ^[2 

(8) rdO/dr = cos 2 6 with r = 2 when 6 = 0 

(9) e x + y (dy/dx) = x with y = o when x = 0 

(10) c/y/c/x = (1 + y 2 )(i + x 2 ) with y = o when x = 0 

9.C.(b) Putting flesh on the bones - some practical uses for differential equations 

There are many situations where we know the rate of change of a physical quantity but we 
need information about the quantity itself, as when we found velocity from acceleration, or 
distance travelled from a known but changing velocity. In this section, we will use the 
techniques of the earlier part of this chapter to work through some practical examples of 
differential equations together. 

Radioactive decay 

I mentioned this in Section 3.C.(f) as an example of a physical process which is described 
using e. We can now see how this happens. 

The decay takes place because unstable nuclei of a radioactive element (called 
radionuclides), emit particles so that they are then converted into nuclei of another element; 
so that, for example, a certain type of uranium gets changed into lead. These particles don't 
all get emitted simultaneously; the physical law which governs this process says that, if there 
are TV radionuclides present after a time t, then the rate of decay at this time is directly 
proportional to N. 

Try writing down an equation for yourself which says this. 



We can say that the rate of change of N with t is dNIdt. 

It is directly proportional to N, so it is equal to kN where k is a constant. (See Section 
3.A.(a) if necessary.) 

The rate of change is negative because N is decreasing so, if we take k to be positive, we 
have dNIdt = -k N. 

Physicists usually use, X, the Greek letter 1, called lambda, instead of k for this constant, 
so I shall do this too. 1 is called the disintegration constant, and depends on the actual 
radioactive substance concerned. 
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Suppose that the number of radionuclides present when t = 0 is N 0 . We know that both 
N and N 0 are positive because they represent physical quantities. We now have 



dNIdt = -XN and N = N 0 when t = 0. 



Try solving this equation yourself, giving both t in terms of N and N in terms of t. 



We have 

fdN 

dNIdt = -XN so — = - Xdt so In N = - At + C. 
J N ] 

Since N = N 0 when t = 0, we know that C = In N 0 . 

You can see here that, if we had left out the constant of integration, any later working 
would all be wrong. Also, because both N and N 0 are positive, we don't have to bother 
writing In \N\ and In \N 0 \. 

Now we have 



/ N\ 

In N = - Xt + In N 0 so In — = - Xt 



using the second rule for logs from Section 3.C.(d). 

Doing the reverse process to taking logs on this equation, we can say 



— = e At so N = N 0 e 
N 0 



It is often very useful to know the time taken for half the radionuclides to have decayed. 
This will depend on the particular value of X for the substance concerned. We have 

In (AWV 0 ) = - Xt. 

If N = \ No we have In (j) = - Xt, but 

In (5) = In (1) - In (2) = - In (2) 

so 

- In 2 = - Xt so t = (In 2)/X. 

This special value of t is called r, the Greek letter t, pronounced to rhyme with Ow. 
It is called the half-life of the substance. We now know that 



the half-life of the substance is given by r = (In 2)1 X. 
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Half-lives vary enormously. For example, K, a radioactive form of potassium, has a half- 
life of 1.25 X 10 9 years. It decays into a form of argon, and the age of rocks can be 
calculated from the ratio of the amounts of these two substances in them. On the other hand, 
there is a radioactive form of iodine, 128 I, which has a half-life of 25 minutes. It is used 
medically as a tracer to measure the rate at which iodine is absorbed by the thyroid 
gland. 

example (1) A calculation based on radioactive decay. There is a radioactive form of 
carbon which is produced at a constant rate in the upper atmosphere, 
and which then mixes with the ordinary carbon which is present in the 
air as carbon dioxide. All living plants and animals need air to keep 
them alive, so they are taking in a certain amount of this radioactive 
carbon all the while. Once they die, they no longer take in fresh 
supplies, and the radioactive carbon already present in them will decay. 
It has a half-life of 5730 years. 

A human skeleton is dug up in a field. If the bones of this skeleton 
have 90% of the radioactive carbon which would be present in living 
bones, how long is it since the owner of the skeleton died? 

Have a go at working this out for yourself. 



We have 

In 2 

r = 5730 = (In 2)IX so X = and also N = 90% N 0 = jzN 0 . 

5730 

Using In (N/N 0 ) = - Xt gives 
In 2 

In (0.9) = t so t = 870 years to 2 s.f. 

5730 

Two chemical examples 

example (l) Using a first-order rate law. If the rate of a chemical reaction can be 
described by the equation - dcldt = kc where k is a constant and c is 
the concentration of a solution in moles per litre, then this is called a 
first-order rate equation. (The rate of change of the concentration 
depends on the concentration at that time.) 

If c 0 is the concentration when t = 0, show for yourself that 
In (c 0 /c) = kt. 



This uses exactly the same maths as the previous example although it 
is describing a different physical situation. We have 



f dc 

I — = - Jkdt so In c = - kt + C. 



Now c = c 0 when t = 0, so C = In c 0 , and 

In (c 0 /c) = kt. (1) 
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If you have experimental pairs of results for c and t, you can see from 
this equation that plotting In (c 0 /c) against t will give you a straight line 
through the origin with a gradient of k. (I have described how this 
works in more detail in Section 3.D.) 

If you are working using logs to base 10, you can use the rule from 
Section 8.B.(d) that In a = (In 10) (log a). 

This gives us 

In ( — ] = (In 10) log ( — J = 2.303 log 



\ c J \ c I 

(Remember that 'log' means log 10 .) 

This means that we can write equation (1) above as 

/ c 0 \ -kt 

2.303 log — = kt or log c = 1- log c 0 

\ c I 2.303 

using the second law of logs. 

From this, we can see that plotting log c against t will also give a 
straight line, with a gradient of - &/2.303, and an intercept on the 
vertical axis of log c 0 , which means that we can find k and c 0 for that 
particular reaction. 

example (2) Plotting results which are connected by Arrhenius' equation. This is a 
mathematically similar equation which you might meet in chemistry or 
theory of materials courses. We do the same kind of thing with it to get 
a straight line graph. 

Arrhenius' equation says that 

k = Ae~ EJRT or k = A exp \ —^- 

\ RT 

using the alternative way of writing powers of e. 

A is a constant called the frequency factor or Arrhenius constant, E a 
is the activation energy, R is the gas constant, and T is the absolute 
temperature. (The meanings of these letters won't affect the maths of 
how we get the straight line, but of course they will be very important 
to you if you are a chemist or a materials engineer.) 

We have 

/ E a \ E a 
k/A = exp - — so In (k/A) = - — 
\ RT) RT 

or In k = In A - — . 

RT 

Comparing this with y = mx + c, as I showed you how to do in Section 
3.D, you will see that, if you plot In k against l/T, you will get a 
straight line which has a gradient of - EJR, and which cuts the vertical 
axis at In A. From these measurements, you can find both .4 and E a . 

As in the last example, you can work with logs to base 10 if you 
prefer. Your working equation then becomes 

E a 

log k = log A . 

2.303i?r 
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Newton's Law of Cooling 

This law states that the rate of cooling of a previously heated body is proportional to the 
difference between its temperature T after time t, and the temperature T c of the surrounding 
environment. Try answering the following questions about this law. 

(1) When will the heated body cool fastest? 



(1) It cools fastest when it is first brought into the cooler environment. 'Come and eat 
it while it's hot! ' makes good physical sense. 

(2) Write down an equation for the rate of change of temperature with time. 



(2) dTldt = -k(T - T c ) where k is a positive constant. 

(3) If the temperature of the body is T h when t = 0, solve this equation. (I've used h 
for 'hot' and c for 'cold' here.) 



(3) Separating the variables, we have 
dT 



J ~ ~ =-k Jdt so In | T- T c \ = - kt + C. 



Since T= T h when t = 0, we have C = In \T h - T c \. Also, while the object is cooling, 
we have T h > T> T c , so we can say 

\n{T-T c ) = -kt + \n{T h -T c ). (1) 

Using the second law of logs on equation (1) gives us 

In ( T — ) = -kt. (2) 



Antilogging, we can write this as 

T - T r . 



— „-kt 



c . (3) 

Th ~ T c 

Which one of these equations we find easiest to work with will depend on what 
we are looking for. 

(4) We'll assume that Newton's Law of Cooling will apply to a cup of coffee. (This is 
a rather complicated physical object, so experimental verification of this law might 
be sensible in this case. The maths will only tell you the consequences of a physical 
law; it won't tell you how well it fits the actual physical circumstances.) 

Suppose that the temperature of the coffee is 90 °C when it is brought into a 
room at 25 °C, and that after ten minutes it cools down to 60 °C. 
Working in minutes and °C, find the value of A: to 2 s.f. 



(4) Using equation (2), we have 



35' 

In | — | = - 10& so k = 0.062 to 2 s.f. 
65 
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(5) (a) Find its temperature after 5 minutes to the nearest degree. 

(b) Find its temperature after an hour. What do you notice? What happens after 
two hours? 

(c) How long will it take to cool down to 37 °C? (This is about blood heat.) 



(5) (a) When t = 5, we have T- 25 = 65e~ 0 - 31 from equation (3), so T= 73. After 5 
minutes, it has cooled to about 73 °C. 

(b) When t = 60, we have T - 25 = 65e" 3 - 72 so T = 27 to 2 s.f. 

The coffee is now very close to the temperature of the room. After two 
hours, it will be equal to the room temperature to 2 s.f. We can see 
mathematically that it gets closer and closer to 25 because e~° 062t gets closer 
and closer to zero as the value of t gets larger. 

12 

(c) Using equation (2), when T= 37 we have In (gj) = -0.062? so t = 27. It takes 
about 27 minutes to cool to blood heat. 

A leaking cone 

An upside-down cone of height H and radius R is filled with water. I show a drawing of it 
in Figure 9.C1. 




Figure 9.C.1 

Suddenly, the tip of the cone springs a leak, and the water begins to run out. The rate at 
which the volume of water in the cone decreases is proportional to its depth. 

We'll now work together to find out the rule which describes how the depth of the water 
in the cone changes. 

If the volume of water left in the cone is V after a time of t, and its depth is h, write down 
an equation for dvldt. 



We will have dvldt = -kh where A: is a positive constant. 

At first sight, this equation looks as if it will work in the same kind of way as our 
previous examples so that it will give us a log. This is not so, however, because here we are 
talking about a rate of change of volume given in terms of depth. We have an equation with 
three variables, V, h and t. We also know that V = j Jtr h. 
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If we are using the techniques described in this book, we can only move forwards now 
if we can find a way to describe V in terms of just the one variable h. (There was a very 
similar situation in Example (2) of Section 8.E.(d).) 

Use Figure 9. C. 1(b) above to find r in terms of R, H and h. 

From this, what is V in terms of R, H and hi 

Using what you now have, what is dVldhl 



We have rlh = RIH, because we have similar triangles, so r = RhIH 
Therefore, 

R\ 2 . dV (RV 

— h so — = Jt \ — | 
M dh \H 



V= \ Jtr 2 h = \ jt ( — | h 3 so — = Ji\ — \h 2 . 



We can now use the Chain Rule to say that dV/dt = (dV/dh) (dh/dt). Doing this, what 
is dh/dt? 



We have 



dV (RV dh dh 1 (kH 2 

= -kh = 71 I — h 2 — so 



dt \H) dt dt h \7cR 

The quantity in the bracket is itself a constant because it is made up of constant 
quantities. 

Now separate the variables so that you can integrate. 

See what you get when you do this, remembering that h = H when t = 0. 



You should have 



kH 2 \ f , / kH 2 



hdh = -\ I I dt so \h 2 = - I \t + C 

jtR 2 /} \7iR 2 ) 

and, since h = H when t = 0, we have C = \ H 2 . This gives us 

kH 2 \ , 

— )t = \(H 2 -h 2 ). (1) 

71 R 2 

If the time taken for the cone to empty completely is T, show that k = . 

IT 

What does equation (1) now become if you substitute this instead of k? 



kH 2 \ , TlR 2 
t = T when h = 0 so ) T = , H so k= 



„nR 2 I IT 
Replacing k by this in equation (1) gives 
'jtR 2 \ / H 2 \ H 2 t 



t = = H H ~ h 2 ). (2) 

2T)\jxR z ) IT 
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Now answer the following two questions, using equation (2). 

(a) How long does it take for the water to get to a depth of \ W. 

(b) How deep is the water after half the time which it takes for all the water to leak out 
of the cone? 



1 ^13t 3 

(a) When h = 5 H, we have — — = 5 (4 H 2 ) so t = ^T. 

H 2 H 

(b) When r = \t, we have — = \ (H 2 - h 2 ) so h 2 = \H 2 so A = -=. 

4 ^2 

exercise 9.C.2 (1) Here are two examples of second-order rate equations from chemistry. 

(a) - dc/dt = kc 2 (b) dx/dt = k(a-x)(b-x) 

(A second-order rate equation is one in which the rate of the reaction is 
proportional to the square of the concentration of one of the reagents, or to 
the product of the concentrations of two species of reagents.) 

For (a), show that, if c stands for the varying concentration of the reagent, 
and c = c Q when t = o, then l/c - i/c 0 = kt. 

For (b), show that, if a-x and b-x are the concentrations of reagents A and B 
after timer, with a and b being the concentrations of A and B when f = o, then 

1 , (b(a-x)\ 

In = kt. 

a-b \a(b-x)J 

(2) A particle moves in a straight line so that its distance from a fixed point after 
a time f is s. Its velocity after this time is v. If s = o and v = u when r = o, 
and the particle has a constant acceleration a, show that the motion of the 
particle can be described by the following four equations. 

(a) v = u + at (b) s = ut + \ at 2 

( u + v\ 

(c) s = I J f (d) v 2 = u 2 + 20s. 

(3) A particle is falling under gravity but with a drag force acting on it resisting its 
downwards motion. This drag force is proportional to its velocity, so that its 
acceleration at time f is given by g - kv. 

If the particle starts from rest when f = o, show that the effect of the drag 
force is to stop it falling faster and faster, and that its velocity gets closer and 
closer to the constant value of g/k. This is called its terminal velocity. 

(4) A second particle which is falling from rest under gravity also has a drag force 
acting on it, but this time the force is proportional to the square of its velocity. If 
its velocity is v after a time f, and its acceleration is given by dv/dt = g- kv 2 , 
show that x, the distance travelled after time t, is given by the equation 

1 

x = — In 



2k \g-kv 2 
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To show this, you will need to use the Chain Rule in the form 

dv dv dx dv dv 

— = = v — so v — = g - kv 2 . 

dt dx dt dx dx 

Now rearrange the answer you have already got which gives x in terms of v 
so that it gives v in terms of x instead. Then use this rearranged form to show 
that, as x increases, v will become closer and closer to the constant value of 



9.C.(c) A forwards look at some other kinds of differential equation, including 
ones which describe SHM 

We have been able to solve all the differential equations which we have looked at so far in 
this section by separating the variables into two integrals, one for each variable. This is the 
only kind of differential equation which I show you how to do in detail in this book, but we 
can look at a few special cases of other types here. This is because we have already met them 
working the other way round, and also because they are so important physically that I think 
it will help you to have met some examples informally when you come to study their 
solutions in your courses. 

We'll go back first to Section 8.A.(e) where we showed that if x = cos t then d 2 xldt 2 = -x. 
As we said there, this is an example of a differential equation which describes SHM. This 
means that it describes the motion of a particle on a straight line whose acceleration is 
always towards a fixed point, and proportional to the distance of the particle from this point. 
SHM is of enormous importance to physicists and engineers, because it can be used to 
describe the effect of combinations of small vibrations. 

In Section 8.A.(e), we couldn't yet differentiate more complicated sin and cos 
functions. 

Now that this is no problem, find dxldt and d 2 xldt 2 for the following functions and write 
down an equation relating x and d 2 xldt 2 in each case. 

(a) x = 3 cos t (b) x = cos 2t (c) x = cos(f + jt/6) 

(d) x = 2 cos (t - 7tl€) (e) x = 5 sin (3t + Jt/6) (f) x = A cos (cot + e) 



You should find that each of the given equations for x is a solution of the SHM 
differential equation of d 2 xldt 2 = - k 2 x where k is some positive number. 
Your working should go as follows: 

(a) dxldt = -3 sin t and d 2 xldt 2 = -3 cos t so d 2 xldt 2 = -x. 

(b) dxldt = -2 sin 2t and d 2 xldt 2 = -4 cos 2t so d 2 xldt 2 = -Ax. 

(c) dxldt = - sin (t + jt/6) and d 2 xldt 2 = - cos (t + 7il6) so d 2 xldt 2 = -x. 

(d) dxldt = -2 sin(f - 71/ 6) and d 2 xldt 2 = -2 cos (t - tt/6) so d 2 xldt 2 = -x. 

(e) dxldt = 15 cos (3? + jt/6) and d 2 xldt 2 = -45 sin(3^ + jt/6) so d 2 xldt 2 = -9x. 

(f) dxldt = -Acq sin (cot + e) and d 2 x/dt 2 = - Acq 2 cos (cot + e) so d 2 xldt 2 = -co 2 x. 

These examples all come from Section 5.C.(b). If you go back there, you will find 
drawings of (a), (b) and (c), and you can see the distance x marked on them. This will help 
you to picture how this distance will change with time, and what it is when t = 0. 
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Examples (d) and (e) are questions (7) and (8) from Exercise 5.C.2. Again, you can use 
your drawings to help you to picture what is happening physically here. 

Example (f) above gives you a general form of the SHM equation, using the letters which 
you will commonly find in scientific and engineering applications. A and oo are explained in 
Sections 5.C.(a) and (b), and e, which gives the starting position when t = 0, is the Greek 
letter e which is called epsilon. You can see from Sections 5.D.(f) and (g) exactly how the 
e works, and also that A cos (cot + e) can be written in the form of two separate sin and cos 
functions. 

It would also be possible to use the form A sin (cot + e) if you wish. As an example of this, 
we showed in Section 5.D.(g) that 3 sin 2t + cos 2t can also be written as ylO sin (2? + a), 
with a = 0.32 radians to 2 d.p. 

You can see a sketch of the graph of x = -JlO sin (2? + a) in Figure 5.D.9(b), and the 
sketch showing the motion of P on its circle giving the vertical distance x in Figure 5.D.9(a). 
x could represent the displacement from O of a particle moving in SHM as t changes. In this 
example, A = yj 10, oo = 2 and e = a = 0.32. 

If there are other forces acting as well as a force which on its own would produce SHM, 
more complicated equations are needed to describe the motion. Questions (9) and (10) in 
Exercise 8.C.4 show you examples of such equations, and they also give you some idea of 
how we could set about solving them by using powers of e found from the roots of quadratic 
equations. 

Have another look at these two questions now and then see if you can solve the 
differential equation d 2 xldt 2 - 5dxldt + 6 = 0 by putting x = e kt . 



Finding d 2 x/dt 2 and dxldt and substituting these into the equation, you should find that 
e fa (k 2 - 5k + 6) = 0 so k 2 - 5k + 6 = 0, since e k> is never equal to zero. 
This means that k = 2 or k = 3. 

Making use of both these possibilities, and writing the solution in the most general way, 
gives x = Ae 2t + Be 3t where A and B are two constants of integration. 

Now, if we also know in this particular case that x = 0 when t = 0, and that dxldt = 2 when 
t = 0, see if you can find out what the values of A and B are. 



Putting x = 0 and t = 0 in x = Ae 2t + Be 3t gives 0 = A + B. 

Now, dxldt = 2Ae 2t + 3Be 3t , so if dxldt = 2 when t = 0 we also know that 2 = 2 A + 35. 

Solving these two equations, we find that B = 2 and A = -2. Therefore the solution of the 
differential equation is x = 2e 3t - 2e 2t . 

It seems a little curious that this answer appears to come out so differently to the solutions 
of the SHM equations from the beginning of this section, since these also involved d 2 xldt 2 
and x. You would expect that the method of putting x = e kt which we have just used should 
also work for these SHM equations, and yet they all had solutions for x which involved a sin 
or a cos instead of powers of e. What actually happens if you put x = e kt in the equation 
d 2 xldt 2 = -9x, for example? 

We shall find out in Section 10.C.(b) that sin, cos and e are not as unrelated as they might 
seem. I shall show you how to solve just one differential equation describing SHM in 
Section 10.C.(i) because its solution connects together so many of the ideas in this book. 
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Physically, differential equations of this type are very important. Two examples of 
situations which can be described mathematically by such equations are the possible fate of 
a suspension bridge if a platoon of soldiers marches across it in step, and the combined 
effect of the suspension of a car, its dampers and the bumps in the road on its vertical motion 
as it travels along. 
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Complex numbers 



In this chapter we powerfully extend the possibilities of what we can do with our 
number system. This extension leads us to simpler ways of finding mathematical 
rules. It also has many very important physical applications. 
The chapter is divided into the following sections. 

10.A A new sort of number 

(a) Finding the missing roots, (b) Finding roots for all quadratic equations, 
(c) Modulus and argument (or mod and arg for short) 

10.B Doing arithmetic with complex numbers 

(a) Addition and subtraction, (b) Multiplication of complex numbers, 

(c) Dividing complex numbers in mod/arg form, (d) What are complex conjugates? 
(e) Using complex conjugates to simplify fractions 

10.C How e connects with complex numbers 

(a) Two for the price of one - equating real and imaginary parts, 

(b) How does e get involved? (c) What is the geometrical meaning of z = e l0 2 

(d) What is e~'° and what does it do geometrically? 

(e) A summary of the sin/cos and sinh/cosh links, (f) De Moivre's Theorem, 

(g) Another example: writing cos 5$ in terms of cos 0, 

(h) More examples of writing trig functions in different forms, 

(i) Solving a differential equation which describes SHM, 

0) A first look at how we can use complex numbers to describe electric circuits 

10.D Using complex numbers to solve more equations 

(a) Finding the n roots of z" = a + bj, 

(b) Solving quadratic equations with complex coefficients, 

(c) Solving cubic and quartic equations with complex roots 

10.E Finding where z can be if it must fit particular rules 

(a) Some simple examples of paths or regions where z must lie, 

(b) What do we do if z has been shifted? (c) Using algebra to find where z can be, 

(d) Another example involving a relationship between w and z 



10.A A new sort of number 

lO.A.(a) Finding the missing roots 

So far, we have been able to show all of the answers or roots of the equations which we have 
been able to solve as points on the horizontal axis of a graph. If we write the equation as 
f(x) = 0, then its solutions are where the curve of y =f(x) cuts the x-axis, so that y = 0. 
These solutions or roots have included different kinds of numbers. For example: 

(a) 2x 2 - x - 6 = 0 has the two roots x = 2 or x = -§ since 
2x 2 - x - 6 = (x - 2) (2x + 3). 

(b) x 2 - 4x - 1 = 0 has the two roots x = 2 ± \[5, using the quadratic formula from 
Section 2.D.(d). 

(c) 2x 3 - x 2 - 5x - 2 = 0 has the three roots x = - j, x = 2 and x = - 1 
since 2x 3 - x 2 - 5x - 2 = (2x + 1) (x - 2) (x + 1). 

(d) 4x 4 - 5x 2 + 1 = 0 has the four roots of x = 1, x = -l, x = j and x = -5 
since 4x 4 - 5x 2 + 1 = (x - l)(x + l)(2x - 1) (2x + 1). 
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In order to have all these solutions, we have had to extend the system of counting 
numbers 1, 2, 3, 4, ... to include negative numbers, fractions, and numbers like ^5 which 
can't be written as fractions. I described all these different kinds of numbers in Section I.E. 
Together, they make up what are known as the real numbers. We can think of the horizontal 
axis of a graph as a number line which contains all the solutions to equations like (a), (b), 
(c) and (d) above. 

But we have also found that sometimes equations which look as though they ought to 
have solutions, such as x 2 - 2x + 2 = 0, have no solution given by any number which we can 
find on the x-axis. It appears that sometimes the number of solutions tallies with the highest 
power of x, as in the four examples of (a), (b), (c) and (d) above, but sometimes this doesn't 
seem to work. This peculiar situation is shown particularly clearly by what seem at first sight 
to be the simplest possible equations of this kind. 

We know that the equation x 2 = 1 has the two solutions or roots of x = +1 or x = — 1. 

However, the equation x 3 = 1, unlike (c) above with its three solutions, only has the one 
solution of x = 1 . 

The equation x 4 = 1 again has the two solutions of x = +1 or x = -1. 
With x 5 = 1, we are back to just the one solution of x = 1. 

As we take higher powers of x, this pattern will continue, with just one solution of x = 
+1 for odd powers, and the two solutions of x = +1 or x = -1 for even powers. 

This is somehow not very satisfying; would it not feel more correct if x 2 = 1 had two 
roots, x 3 = 1 had three roots, x 4 = 1 had four roots and so on? But where would they be? 
How could we widen our number system so that we would have these extra roots or 
solutions? 

Suppose we take the horizontal axis out of the graph paper, and lay it out separately as 
a number line which contains all the roots of the equations which we can so far solve, 
including equations (a), (b), (c) and (d) above. This number line, which I have drawn in 
Figure 1 0.A.1, shows all the real numbers. (The arrows are there to show that this line can 
be infinitely extended in either direction.) 



Figure 10.A.1 

Now imagine that we are looking down on this number line and seeing all these roots. We 
are also seeing the various roots of x = +1 and x = -1 for the equations x n = 1. (We see both 
+1 and -1 if n is even, and just +1 if n is odd.) 

If we take the particular case of x 4 = 1, we have x = +1 and x = -1 as two of its roots. 
If we want this equation to have four roots, where could we think of the other two roots as 
being? 



Suppose we think that up to now we have just seen the possible answers through a slit 
which allows us to see the number line of Figure 1 O.A.I. Sometimes this has meant that we 
could see all the possible solutions to an equation and sometimes it has meant that there are 
solutions which are somehow off to the side so that they are hidden from us. If so, it seems 
reasonable that the four roots of x 4 = 1 should be symmetrically placed. This would give us 
the four roots (a) (b) (c) and (d) which I have shown in Figure 10.A.2(a). 
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It seems a good idea that the two roots on the vertical axis should also be placed one unit 
away from O. But what are they? 

They would have to be two different numbers, each of which multiplied by itself four 
times would give the answer of +1. 

We know that (-1) 2 = +1, so if we can somehow think of the vertical axis as showing 
units of yf—l, as the horizontal axis shows units of v'+l, we shall be able to have the four 
roots which we would like. 

Each root is one unit away from the origin, so root (b) would be y^l and root (d) would 
be-V^T. 

If we now let ourselves say that yf—l X ^PT = -1, the two extra roots will work in the 
following way 

For (b), we shall have 

(V^I) 4 = (V=T) x (V=T) x (V=T) x (V=T) = (-1) x (-1) = +i. 

For (d), we shall have 

(-V^T) 4 = (-V=T) x (-V=T) x (-V=T) x (-V=T) = +1 

like (b), since each pair of minuses multiplied gives a plus. 

To emphasise that we have invented a new number here, which needs a separate direction 
of its own to show it, we shall call yf—l by the letter j. 



We are defining a new number j such that j 2 = -1 so j = 



We can now write the four roots of x 4 = 1 as x = +1, x = -1, x =j and x = -j with each 
of these roots being one unit away from the origin. I show them in Figure 10.A.2(b). 

The horizontal axis shows the real numbers (just like any ordinary x-axis) so it is called 
the real axis and labelled Re instead of x. 

The numbers shown on the vertical axis, which are measured in units of j, are called 
imaginary numbers. (This curious name is for historical reasons.) We therefore call the 
vertical axis the imaginary axis and label it Im instead of y. 

Figure 10.A.2(b) is an example of what is called an Argand diagram. It is named 
after the Swiss mathematician who first thought of showing complex numbers in this 
way. 



Complex numbers 



# Mathematicians often use i rather than j for yf—l. However, physicists and 
engineers usually use j, because imaginary numbers have important physical 
applications in the study of electric circuits where the letter i is often used 
for current. 



Can you draw a sketch showing where you think the eight roots of x 8 = 1 
might be? We shall come back to this at the end of Section lO.B.(b). 



10.A.(b) Finding roots for alt quadratic equations 

Next we shall find out whether having this new number j will make it possible for us to find 
solutions for all quadratic equations. 

Suppose we take as an example the equation x 2 - 8x + 25 = 0. 

It is usual to use z instead of x if we are extending the possibilities for roots by using these 
new numbers, so I shall rewrite this equation as z 2 - 8z + 25 = 0. 

Now we see what happens if we use the quadratic formula. Using z, this formula says 

-b ± yjb 2 - 4ac 

If az 2 + bz + c = 0 then z = . (Section 2.D.(d)) 

2a 

Here, we get 

+8 ± V64 - 100 +8 ± \P36 +8 ± 6 V 3 ! 
Z 2 2 2 

so 

z = +4 ± 3 v-1 giving Zj = 4 + 3j and z 2 = 4 - 3j 

using our new number j for yf—l, and calling the roots z l and z 2 . 

(Try checking for yourself whether you think each of these two roots do fit the equation 
by substituting each of them back into the equation in turn and seeing what happens.) 

Notice that the two roots which we now have for this equation are each made up of two 
parts, the +4 which is a real number, and the 3j which is then either added or subtracted. We 
can show these two roots on the same sort of diagram that we used in Figure 10.A.2(b), with 
the real parts lying along the horizontal or real axis, and the imaginary parts lying along 
the vertical or imaginary axis. I have drawn this particular pair of roots in Figure 10.A.3. 



-*.lm 




Figure 10.A.3 
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Notice that they have the property that they are symmetrically placed either side of the real 
axis. We shall look at the special properties of pairs like this in Section lO.B.(d). 

I shall now use the number 4 - 3/ shown in Figure 10.A.3 to give you some 
definitions. 



A number like 4 - 3j is called a complex number. 

4 is called its real part and -3 is called its imaginary part. 

The number 4 gives the measurement along the real axis. 

The number -3 gives the measurement along the imaginary axis. 

If z = 4 - 3/ then we say Re (z) = 4 and Im (z) = -3. 



(Notice that the imaginary part tells us how many units of j we have; it does not actually 
include the j.) 

exercise io.a.1 Solve the following equations writing your answers in the same way that we used 

in the example above. (This is called writing them in the form a ± bj.) 

Show each pair of roots on a separate Argand diagram. Save these answers 
and sketches as you will need them for the next exercise. 

(If you feel shaky about V36 being the same as 6, or ^/i2 being the same as 
2^3, you should read through Section i.F.(c) and then do Exercise 1.F.3 before 
continuing.) 

(1) z 2 - 2z + 2 = o (2) z 2 - 4Z + 13 = o (3) z 2 + 4Z + 5 = 0 (4) z 2 + 2Z + 6 = o 

10.A.(c) Modulus and argument (or mod and arg for short) 

In the last section, we found that the two roots of the equation z 2 - 8z + 25 = 0 are z x = 4 + 3/ 
and z 2 = 4 - 3/ and I showed them on the Argand diagram of Figure 10.A.3. In Figure 10.A.4, 
I show an alternative way of describing the positions of z x and z 2 on an Argand diagram. 
Instead of using the pairs of coordinates given by their real and imaginary parts, we can use 
two other measurements. 

The first of these is their lengths which I have labelled \z x | and |z 2 1 on the diagram. 

The length of a complex number is called its modulus or mod, and it is written as 
mod z or |z|. 




Figure 10.A.4 
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From Figure 10.A.4 we see that |zj| = ^A 2 + 3 2 = 5 by Pythagoras' Theorem. 
Also \z 2 \ = \z x \, since \z 2 \ = ^4 2 + (-3) 2 = 5. 

For any complex number z = x + yj we have the following definition. 



The modulus or length of the complex number z = x + yj is given by |z| = yx 2 + j 2 . 
Because |z| is a length, it is always positive. 



If we want to talk about more than one modulus, we call them moduli and not moduluses. 
(This is because 'modulus' is a Latin word.) 

To draw a complex number z on an Argand diagram it will not be enough just to know 
its length. We can see this in the example above where both z l and z 2 have the same length. 
In order to describe them fully, we also need to know the direction in which we should draw 
them. This direction can be described by using the angles turned through from the positive 
real axis to get to each of them. I have called these two angles 0j and 9 2 on Figure 10.A.4. 
In this particular example, these two angles are equal in size, but are turning in opposite 
directions from the positive real axis. 



The angle turned through from the positive real axis to give a complex number, z, 
is called its argument. 
It is written as arg z. 

An anticlockwise turn gives a positive angle. 
A clockwise turn gives a negative angle. 



Here, 

arg z 1 =d l = tan -1 (|) = 0.644 radians to 3 d.p. 
and arg z 2 = 9 2 = - tan -1 (|) = -0.644 radians to 3 d.p. 

Writing this pair of angles in this way shows their symmetry very nicely. It is true that 
we could also get to the same position of z 2 by turning the other way. This would give an 
angle 9 2 = 2n - tan -1 (5) = 5.640 radians to 3 d.p. but this hides the symmetry of 9 r and 9 2 , 
coming from the symmetrical pair of 4 ± 3j. 

For this reason we give this further definition. 



The principal value of the argument of a complex number z lies between - jt 
and jt, so that - jt < 9 < jt. 



Therefore, the values I have given above for 9 Y and 9 2 are the principal values for the 
arguments of z x and z 2 . 

^^^^ It is rather easy to make mistakes when finding the arguments of complex 
^B-^V numbers if you use a calculator to find tan 1 9. The following example 
shows why this is. 
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We will find the mod and arg of the pair of solutions to the equation z 2 + 2z + 4 = 0. 
Using the formula gives 

-2 ± f-\2 - 2 ± 2 J-3 

z = = = - 1 ± 

2 2 

I show the pair of roots z x = - 1 + yfjj and z 2 = - 1 - y[3j in Figure 10.A.5. 




Figure 10.A.5 

We have 

kl = V(-1) 2 + (V3) 2 = V4 = 2 and |z 2 | = V(-l) 2 + H/3) 2 = 2. 
We can also see from the diagram that 

arg zj = 6i = 71 - tan -1 3) = Jt - tt/3 = 2jr/3, 

and 

arg z 2 = 9 2 = -7i + tan -1 (^3) = -tt + 7t/3 = -2jt/3. 

But if you had used tan -1 (^37-1) = tan -1 (-y\3) to find 9\ you would have got - jt/3 
which is an angle in the fourth quadrant, and so not the right place at all. To remind you 
where each quadrant is, I have labelled them in the diagram above as (1), (2), (3) and 
(4). 

Similarly, if you had said 9 2 = tan -1 (-\3/- 1) = tan -1 yfj, you would have got the angle 
of 7t/3 in the first quadrant which is again the wrong place. The problem is that the function 
tan -1 is only defined for the range from -jt/2 to + jt/2. (We saw why this is in Section 
5.A.(i).) 

In this particular example, you could of course find 0 2 from its symmetry with 9 X . 

You should always draw a little sketch of what is happening when finding arguments so 
that you don 't get them in the wrong place. 

(To help you to check your answers, remember that there are roughly 3 radians in a half- 
turn of 7t, and one radian is roughly 60°.) 

A shorthand version which is often used to write a complex number z in mod/arg form, 
is to write z as [r, 0] where r = \z\ and 9 = arg z. In the example above, we would have Zj 
= [2, 2jt/3] and z 2 = [2, - 2jt/3]. 



V3/ 
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Another shorthand version, which is often used by engineers, is to write rl.9 instead of 
[r, 0]. 




r and 9 can be used instead of x and y in any graph. If the graph is not an 
Argand diagram, then 9 is measured between 0 and 2jt. r and 9 are called 
polar coordinates and x and y are called Cartesian coordinates. This name 
is after the French mathematician and philosopher, Rene Descartes, who is 
also famous for having said 'Cogito ergo sum' or 'I think, therefore I am'. 



What is the connection between z = x + yj and z = [r, #]? 
I show this in Figure 10.A.6. 



A tm 




Figure 10.A.6 

From this diagram, we see the following pair of results. 



x = |z| cos 9 = r cos 9 and y = |z| sin 9 = r sin 9. 
So z = x + yj = r (cos 9 +j sin 9). 



Looking back, you will see that these relationships are true for all the diagrams wherever 
z is. The size of 9 automatically takes care of the signs of x and y. We are making use here 
of the definitions for the sin and the cos of angles greater than nil. These came from the turn 
of a unit length about the origin (in Section 5A.(c)), so they are very much related to what 
we are doing now. 

exercise io.a.2 In each of the following questions, give the arguments in radians correct to 2 d.p. 

unless they are an exact fraction of n. 

(1) Find the modulus and argument of each of the following complex numbers. 

(a) 3 (b) -2 (c) 2/ (d) -5/ (e) 5 + 12/ 

(f) -5 - 12/ (g) 7 - 24/ (h) -7 + 24/ (i) - v 3 + / 0) - 1 - / 

(2) Figure io.A.2(b) in Section io.A.(a) shows the four roots of z 4 = 1. 
What is the modulus and argument of each of these? 

(3) Find the modulus and argument for each of the pairs of roots of the quadratic 
equations which you solved in Exercise 10.A.1. Use your sketches of these 
roots to make sure that your arguments are in the right place. 
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10.B Doing arithmetic with complex numbers 

When we do arithmetic with these numbers, we assume that each separate part of them, the 
real and the imaginary, behaves within itself according to the usual rules for numbers. The 
only extra property is that these two tracks of numbers are not forever running beside each 
other with no communication; every time we get a j 2 we get a cross-over from the imaginary 
to the real. In fact, any even power of j will do this. This makes their possibilities much more 
interesting. 

lO.B.(a) Addition and subtraction 

Suppose we have two complex numbers z x = 3 + j and z 2 = 2 + 5j. 

Since each number is made up of two separate independent measurements, we add the 
numbers by adding each of these pairs, giving us (3 + j) + (2 + 5j) = 5 + 6j. 

The order of addition will not matter because (2 + 5j) + (3 +j) = 5 + 6/ also. 

We can show this addition on an Argand diagram if we allow ourselves to shift the second 
number. (See Figure 1 O.B.I (a).) 




Figure 10.B.1 



The two separate displacements of (2 + 5j) and (3 + j) add together to give the single 
displacement of (5 + 6/). We see that the final result is the same whichever order we do the 
addition in - we just get there by a different route. These two different routes put together 
make a parallelogram. 



To add two complex numbers together, we add the real parts and the imaginary 
parts separately. 

For example, (3 - 2j) + (5 + 4j) = 8 + 2j. 



Subtracting complex numbers works equally easily. 



To subtract two complex numbers we subtract the real parts and the imaginary 
parts separately. 

For example, (4 + 3j) - (1 - 2j) = 3 + 5j. 
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If we have any three complex numbers z, z x and z 2 so that z = z x + z 2 , and we draw them 
on an Argand diagram, we can see a useful relationship between their lengths. I show this 
in Figure lO.B.l(b) above. 

Since the length of the third side of a triangle must be shorter than the lengths of the other 
two sides added together (unless the triangle is squashed completely flat, when side (3) = 
side (1) + side (2)), we have the following result. 



If z = z x + z 2 then |z| < \z x \ + |z 2 |, that is, \z x + z 2 | < \z x \ + |z 2 |. 



We could show the addition of any quantity of complex numbers on an Argand diagram. 
The resulting final displacement will give the sum of the complex numbers. 

I show an example of this in Figure 10.B.2 where the sum of the complex numbers is 
zero. We have (4 + j) + (2 + 3j) + (- 3 + If) + (- 3 - 6j) = (0 + Of). 




Figure 10.B.2 



This is also the way in which the addition of vectors works. Vectors are quantities which 
have both magnitude and direction. Complex numbers have direction built into them because 
of their structure of two separate parts written in a particular order. 

exercise io.B.1 If z t = 3 + 5/ and z 2 = - 2 + 2/ and z 3 = - 7 - 2/ find the following. 

(1) z a + z 2 (2) z 1 + z 2 + z 3 (3) z 2 - z 3 (4) z 3 - z a 

10.B.(b) Multiplication of complex numbers 

Multiplying by a positive number or scalar 

If a complex number is multiplied by a positive number, the effect is to change its size by 
the scale given by this number. 
For example, 

3(2 + J) = 6 + 3j and \{A + 2j) = 2 +j. 

I show these in Figure 10.B.3. The numbers 3 and \ have no direction themselves. They 
simply enlarge or shrink the two complex numbers. 
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Figure 10.B.3 



Multiplying two complex numbers together 

Again we use the ordinary rules of arithmetic on the separate parts of the two numbers. 



To multiply two complex numbers together we multiply out the brackets in the usual 
way except that we replace y 2 by - 1 . 
For example, 

(3 + 4j) X (12 + 5j) = 36 + 48/ + 15/ + 20j 2 = 16 + 63/. 



The above example does not seem to show any particular pattern, but there are two very 
nice links between the three numbers z x = (3 + 4j), z 2 = (12 + 5j) and z x z 2 = 16 + 63/. 

Work out for yourself the values of \z x |, |z 2 | and |ziz 2 |> an d see if you can spot the first 
link. 



You should have \z x \ = ^3 2 + 4 2 = 5 and \z 2 \ = V 12 2 + 5 2 = 13 and Iz^l = Vl6 2 + 63 2 
= 65 so \z x \ |z 2 | = |z! z 2 |. 

Now work out for yourself arg z x , arg z 2 and arg(z x z 2 ) and again see if you can spot a 
link between them. 

_ l 4 _ l 5 

You should have arg z x = tan - (3) = 0.927 rad to 3 d.p. and arg z 2 = tan~ (j^) = 0.395 
rad to 3 d.p. and arg^ z 2 ) = tan _1 (f|) = 1.322 rad to 3 d.p. 
It looks as though arg z x + arg z 2 = arg (z x z 2 ). 
We can see that this is exactly so by using the rule 

tan A + tan B 

Xw(A + B) = from Section 5.D.(c). 

1 - tan A tan B 

4 5 

Putting A = argzj and B = arg z 2 we get tan A = 3 and tan B = j^. So 

I + H 48 + 15 63 

tan(arg z x + arg z 2 ) = — f— = = — = tan(arg(zj z 2 )) 

1 -(f)(n) 36 - 20 16 

therefore 

arg(z!Z 2 ) = arg z x + arg z 2 . 

Multiplying complex numbers in mod/arg form 

Is the result which we have just found above a special coincidence because of our particular 
choice of z x and z 2 or is there a reason why they should behave like this? 
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We show that there is an underlying reason for this result by taking any two complex 
numbers z x and z 2 and writing them in modulus/argument form. 

To do this, we let \z x \ = r x and arg z x = 6 X so z x = r 1 (cos 6 X + j sin 9 X ), and \z 2 \ = r 2 and 
arg z 2 = 0 2 so z 2 = r 2 (cos 0 2 + j sin 9 2 ). 

Then 

z \ z i = r \ r 2 ( cos &i cos #2 + j 2 sin #i sin $2 + 7 srn #i cos #2 + 7 cos ®i srn #2)- 

= (cos 9 X cos 6> 2 _ s i n #1 sin #2) + j (sin #1 cos $2 + cos #i sin #2)- 

We can replace the real and imaginary parts of the inside of this bracket using the double 
angle rules of Section 5.D.(b). This gives us 



z \ z 2 = r \ r 2 (cos(0! + 9 2 ) + j sin(9 x + 9 2 )). 



We now have a very neat way of working out the result of multiplying two complex 
numbers together. 



To multiply two complex numbers together we multiply their moduli and add their 
arguments. 



As an example, we'll take the two roots z x = 4 + 3/ and z 2 = 4 - 3j of the quadratic 
equation z 2 - 8z + 25 = 0. Multiplying them together, we get 

z x z 2 = (4 + 3/) (4 - 3j) = 16 + 12/ - 12/ - 9j 2 = 25. 

We also know from Section 10.A.(c) that \z x \ = \z 2 \ = 5 and arg Zj = 0.644 radians while 
arg z 2 = -0.644 radians. 

Multiplying the two moduli gives us 25, and adding the two arguments, taking account 
of their signs, gives us zero, so both ways of multiplying give us the same answer which is 
the real number of 25. 

Notice that this answer agrees with the rule of Section 2.D.(e) which says that the product 
of the roots of a quadratic equation is given by da. (Here, c = 25 and a = 1.) 

The sum of the roots also agrees with the rule given there, since (4 + 3j) + (4 - 3j) = 8 
= - b/a. (Here, b = -8.) 

These two rules continue to work for quadratic equations with complex roots for exactly 
the same reason which we found they worked in Section 2.D.(e). The square root of b 2 - 4ac, 
which is where potential problems can arise, cancels out when the roots are added or 
multiplied. 

The geometrical effect of multiplying complex numbers 

It is now very easy for us to see geometrically what happens to complex numbers when they 
are multiplied together. 

In particular, if we return to the equation z 4 = 1 in Section 10.A.(a), and its solutions 
shown in Figure 10.A.2, we see that multiplying j by itself turns it through 90° or jt/2. 
Multiplying this result by j turns it through another right angle. 

A final multiplication by j makes the last turn through 90° to finish at +1. We see 
geometrically that j 4 = +1. 
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The next root of z = -1 turns through 180° when it is multiplied by itself, which brings 
it to +1. Repeating this process takes it through a full circle again to +1. 

Think for yourself what happens when you multiply -j by itself and how it is that 
repeating this makes you end up at +1 for (-j) 4 . 

The results of all the above multiplications keep us on the unit circle because each of 
these roots is of unit length, but multiplying any complex number by j will turn it through 
a right angle. (This is because \j\ = 1 and argy = jt/2.) 

For example, j(4 + 2j) = 4j + 2j 2 = -2 + 4j. I show this pair in Figure 10.B.4(a). 




Figure 10.B.4 



You should know now whether you got the right answer to the thinking point at the end 
of Section 10.A.(a) which asked you where the eight roots of z 8 = 1 are. 

They all lie on the unit circle with angles of jt/4 between them. 

Each of them has a modulus of 1 and there is one eighth of a full turn between each pair 
of roots so their arguments are 0, Jt/4, jt/2, 3jt/4, jt, -jt/4, -jt/2 and -3tt/4. 

If you haven't yet made your own drawing of these, draw them in now on the unit circle 
shown above in Figure 10.B.4(b) in a different colour to make them show up. Think how 
each of them will turn as it is multiplied by itself. 



exercise io.b.2 (1) If z 1 = 3 + 27, z 2 = - 2 + / and z 3 = -3 - 5/ find the following. 

(a) 3z a (b) 2z a + 3z 2 (c) 2z r + z 2 + z 3 (d) 3z 2 - 4z 3 
(2) (a) If w = z 2 find w for the following values of z: 

(i) z = 2 + / (ii) z = 4 + / (iii) z = 4 + 3/. 

In each case, also find |z| and |w| and show w on an Argand diagram. 

(b) If z = x + jy and w = u + jv, find |z|, \w\, u and v each in terms of x and y. 
If x and y are whole numbers, which of |z|, |w|, u and v must also be whole 
numbers? 

(c) Each of the Argand diagrams from (a) will show you a right-angled triangle 
with sides whose lengths are whole numbers since, in each case, u 
and v are all whole numbers. 

Sets of whole numbers which give the sides of right-angled triangles 
are called Pythagorean triples because they fit Pythagoras' Theorem. 

Two examples are 3, 4, 5 and 5, 12, 13 because 5* = 4 2 + 3 2 and 13 2 = 
12 2 + 5 2 . 

It is possible to get some very strange shapes of triangle with sides 
given by Pythagorean triples. 
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Try finding w = (12 + 5/') 2 . Then find \w\ and draw a sketch to show w 
on an Argand diagram. 

Now use your result for w to find W = w 2 = (12 + 5/') 4 . Find | and 
then show W on a new Argand diagram. 

You could also try finding some new triangles like this for yourself. You 
can check if your working is correct by seeing if the lengths of the sides 
do fit Pythagoras' Theorem. 

(3) Check that the pairs of roots which you found for the four equations in 
Exercise 10.A.1 do fit the equations by substituting them back in. Then find the 
sum and the product for each of these pairs of roots and check that they do 
come to '-b/a' and 'c/a' respectively in each case. 

(4) One of the roots of the equation z 3 = 1 is z = 1. 

(a) Draw a little sketch showing all three roots of this equation. 

(b) Find these three roots giving them both in mod/arg form and in a + bj 
form. 

(c) These three roots are known as 1, m and co 2 , as we move round 
anticlockwise. Check that co X co gives to 2 and that co 2 X a> 2 gives co. 




Can you work out where the four roots of z 4 = -1 will be, drawing them on 
a little sketch and giving each of them both in mod/arg form and in a + bj 
form? 



lO.B.(c) Dividing complex numbers in mod/arg form 

We can get the rules for this by thinking of dividing as a rearranged multiplying. 

Just as with ordinary numbers, we can say (z 1 /z 2 ) z 2 = z x so, from the rules for 
multiplying, we get the following pair of results. 











hi 






kl = hl so 








^2 




^2 


M 


and 










arg(zi/z 2 ) + arg(z 2 ) = arg(zj) so arg( Zl /z 2 ) = arg(zj) - arg(z 2 ). 



To divide two complex numbers we divide their moduli and subtract their 
arguments. 



^^^^ Notice that, just as with ordinary numbers, the order matters here, unlike 
^B-^V when we multiply. z l lz 2 is not the same as z 2 /z,, so you must be careful 
when you work out the new modulus and argument. 
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As an example of using these rules, we'll find (1 + j) (1 -j). I have shown these two 
complex numbers on the Argand diagram in Figure 10.B.5. 




Using this, we can easily write the two complex numbers involved in this division in 
mod/arg form, that is, in the form z = [r, 0] where \z\ = r and arg z = 6. 
We have 



l+y | x 2. .t4| 


\[2 71 / 






71 

}'2_ 


1 -j [V2, - Jt/4] 


J2' 4 \ 







=j- 



lO.B.(d) What are complex conjugates? 

In Section 10A.(b) we found that the roots of the equation z 2 - 8z + 25 = 0 are z x = 4 + 3/ 
and z 2 = 4 - 3j. We saw in the Argand diagram of Figure 10A.3 that they formed a 
symmetrical pair either side of the real axis. 



Pairs of numbers which can be written in the form a + bj and a - bj are called 
complex conjugates. 

If z = a + bj then its conjugate of a - bj is written as z. 
The conjugate of a - bj is a + bj. 



'Conjugates' means a sort of married couple - in fact the word stems from this original 
meaning. 



You will 


see that since 


|z| = ja 2 + b 2 


and |z| = 


Va 2 + (-6) 2 it must be true that 


\A = kl- 

Also, 










zz 


= {a + bj)(a - 


bj) = a 2 + b 2 


= (N) 2 - 


k 2 | 


because 










k 2 


= \{a + bj) 2 | 


= |(a 2 - b 2 ) + 


2aty| = \ 


6 2 ) 2 + 4a 2 6 2 = a 2 + 6 2 . 



You can also see that this must be so from the way in which they lie on the Argand diagram, 
z is the reflection of z in the real axis, so it must be the same length. 
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EXERCISE 10.B.3 



I have shown five complex numbers in Figure 10.B.6. Draw in their conjugates for 
yourself, labelling each one so that you can check if you have put them in the 
right places. 




>(Ee 



Figure 10.B.6 



lO.B.(e) Using complex conjugates to simplify fractions 

Complex conjugates are important in many areas. One of their uses is in simplifying 
fractions which have complex numbers underneath. Suppose we have the fraction 

3+7 
5-2/ 

There is a very neat way of simplifying fractions like this, in which we convert the bottom 
of the fraction (its denominator) into a real number. 

Can you see how we can do this? (We have already used this method for simplifying 
fractions with other square roots underneath in Section l.F.(d). The only difference here is 
that now we are dealing with j, the special square root of — 1.) 



The trick is to multiply the bottom of the fraction by its conjugate. (Of course, in this case 
we must also multiply the top of the fraction by the same thing to keep its whole value 
unchanged.) This gives us 

(3+y) (5 + 2/) _ 15 + 2j 2 + 5j + 6/ _ 13 + Uj 

(5 - 2j) (5 + 2j) 25 - 4j 2 - 10/ + 10/ 29 

This is now in the very much more convenient form of the single complex number 
29 (13 + 11/). What we have done here is, in effect, a division. 
We have found that (3 +f) (5 - 2j) = ^ (13 + Uj). 



exercise 10.B.4 Try simplifying these yourself. 



3 4 1+7 1-2/ 

(1) — (2) (3) — (4) 

2+7 3-27 1-7 5 + 37 

2-7 

(5) (Multiply out the bottom first.) 

(3 +7') (3 - 27) 

3 2 

(6) + (Simplify each separate fraction first, and then add.) 

2-7 3 + 2/ 
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10.C How e connects with complex numbers 

lO.C.(a) Two for the price of one - equating real and imaginary parts 

Because of their structure, complex numbers have a special property which is of enormous 
importance. 



Two complex numbers are equal if and only if each of their real and imaginary 
parts is separately equal. 



We can show that this is true in the following way. 

Let z 1 = a + bj and z 2 = c + dj. Then, if a = c and b = d, it is certainly true that z x =z 2 . 
Now we have to show that if z x = z 2 then a = c and b = d. 

We can see that this must be true geometrically because complex numbers have direction 
as well as length, and therefore the only way that two of them can be equal, and so lie exactly 
on top of each other, is if their real and imaginary parts are separately equal. 

We can also see this rather nicely in the following way, using algebra. 

If Zj = z 2 then a + bj = c + dj so 

a-c = dj -bj =j(d-b). 
Squaring both sides of this equation gives 
(a - cf = j\d - bf 

so 

(a - cf = -(d - bf. 
Therefore 

(a - c f + (d- b f = 0. 

But remember that a, b, c, and d are all real numbers. Because of this, we can say that 
(a - c) 2 > 0 and (d - b f > 0. 

Therefore, the only way for it to be possible that (a - cf + (d - bf = 0 is that each of 
{a - c) and (d - b) are equal to zero. Therefore a = c and d = b. 

This property of complex numbers is of huge importance in their application to physical 
situations, because it means that any equation involving complex numbers is actually made 
up of two separate equations. We are, in a sense, getting two for the price of one. 

example (i) To see an example of this in action, we will solve the equation z 2 = 5 + 12/. 

Let z = a + bj. (We know the solution must be complex because its 
square is a complex number.) 
We have (a + bjf = 5 + 12/ so 

a 2 - b 2 + labj = 5 + 12/. 

This can only be true if both the real parts and the imaginary parts are 
separately equal. 

Equating the real parts gives 

a 2 -b 2 = 5 (1) 

Equating the imaginary parts gives 

lab = 12 (2) 
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From (2) we have a = 12/26 = 61b. Substituting this in equation (1) 
gives 

36 

— -b 2 = 5 so 36-b 4 = 5b 2 so b 4 + 5b 2 - 36 = 0. 
b 2 

This is much easier to solve than it looks at first, being one of those 
quadratic equations which are masquerading as something much nastier 
which we met in Example (4) in Section 2.E.(d). Factorising this 
equation gives (b 2 - 4)(b 2 + 9) = 0, (or you could put b 2 = y, say, and 
then use the formula). This gives us b 2 = 4 or b 2 = -9. 

Because b is a real number, we discard b 2 = -9 as it will only give 
us imaginary solutions. This leaves us with the two solutions of b = +2 
or b = -2. 

The corresponding answers for a are a = 6/2 = 3 or a = 6/-2 = -3 . 
So the two solutions of the equation z 2 = 5 + 12/ are z l = 3 + 2j and 
z 2 = -3 - 2j. Check for yourself that they really work! 
I show these two solutions in Figure 1 0.C.1 (a). 




Figure 10.C.1 



Geometrically, we can see straight away that z x is a solution of z 2 = 5 + 12/. 
This is because \z 2 \ = 13 and \z x \ = /l3, and also 

arg z x = tan _1 (§) and arg(z 2 ) = tan _1 (y) so arg z x = \ (arg z 2 ). 

(Check this numerically on your calculator.) 

We know from the rules for complex multiplication given in Section 10.B.(b) that \z x \ = 
\ z t\ = \/M- We also know that 2 arg z 1 = 2 arg z 2 = arg(z 2 ). 

In Figure lO.B.l.(b), you can see how these angles actually work. Doubling both arg z x 
and arg z 2 brings you round to the direction of z 2 . (Since arg z 2 is negative, doubling it will 
mean that you are moving in a clockwise direction about the origin.) 

The two square roots of a complex number will always be in the form ±(x + jy), so 
together they will always make a straight line on their Argand diagram. 

We shall need to be able to find these square roots when we solve quadratic equations 
with complex coefficients in Section 10.D.(b). 
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exercise io.c.1 (1) Find the square roots of each of the following complex numbers in a + bj form 

and show each of these pairs of roots on an Argand diagram, 
(a) 3 + 4/ (b) 15 + 8/ (c) 5 - 12/ 
(2) Find the modulus and argument of each of the given complex numbers and 
sketch each of them on its own Argand diagram. Then use this to help you to 
find both of the square roots of each of the given numbers, showing them also 
on the Argand diagrams, and giving your answers in mod/arg form, that is, in 
the form z = [r, 6] where r = |z| and 0 = arg z. 
(a) 4/ (b) 1 + V37 (c) -V2 + yfzj (d) -1 -V3/ 



lO.C.(b) How does e get involved? 

At last we are able to solve the mystery of the resemblance between the trigonometrical 
functions cos t and sin t and the hyperbolic functions cosh t and sinh t. 

To do this, we will need to use the series which we found in Section 8.G. I list below the 
ones which we shall need. 

t t 2 t 3 t 4 t 5 t 6 
e'=\+ — + — + — + — + — + — +... (1) 
1! 2! 3! 4! 5! 6! 

t 2 t 4 t 6 

cos t= 1 - — + — - — + .. . (2) 
2! 4! 6! 

(3) 
(4) 
(5) 



At the end of Section 8.G, we were left with the feeling that it should also be possible 
to link e', cos t and sin t together, but we couldn't quite do it then. Now we have the extra 
possibilities given by complex numbers. 

What happens if we put t =j0 in series (1)? We shall get 

je fe 2 fe 3 fe 4 fe 5 fe b 

e je = 1 + J _ + t + { + t + t + t + _ _ 

1! 2! 3! 4! 5! 6! 

je e 2 je 3 e 4 je 5 e 6 

so e> e =\+ — - — - — + — + — - — + ... (6) 
1! 2! 3! 4! 5! 6! 

Now answer these questions yourself. 

(1) Pick out the real parts in series (6) above to give a new series. What do you 
get? 

(2) Make a new series with just the imaginary parts of the series above. Can you see 
what doing this will give you? 

(3) What linking relationship have you now found? 





t t 3 


t 5 


f 


sin t = 






- — + 




1! 3! 


5! 


7! 




t 2 


t 4 


t 6 


cosh t = 


= 1 + — + 




+ — + 




2! 


4! 


6! 




t t 3 


t 5 




sinh t = 


-- — + — -\ 


- — 


- + . . . 




1! 3! 


51 




We know that e* 


= cosh t + 


sinh t. 
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This is what you should have. 

The real parts of series (6) give the series for cos 9. 

The imaginary parts of series (6) give the series for sin 9, so if we include the js too we 
have the series for j sin 9. 

(Note: we are assuming here that it is all right to play around with these series in this way. 
In these particular cases, mathematicians have shown that it is all right but this is by no 
means a general rule. Infinite series have to be treated with great caution, as we saw for 
ourselves in Section 6.F.) 

Putting together what we now have, we finally get our link, which is 



e> = cos 9 +j sin 9. 



This is an amazing and beautiful result, and is due to the Swiss mathematician Euler after 
whom e is named. 



lO.C.(c) What is the geometrical meaning of z = e jV ? 

Since z = e' e = cos 9 +j sin 9 we can see from Figure 10.C.2(a) that z must lie on a circle 
whose centre is at the origin and whose radius is one unit. This circle is known as the unit 
circle about the origin. 




For any value of 9, the point representing z will be somewhere on this unit circle. 
Everywhere on this circle, |z| = 1 and arg z = 9. 

To get the feel of what is happening, we'll look at some particular possibilities for z as 
9 varies. 

As a first example, if 9 r = 7t/4, we get the point z 1 = cos(jt/4) + j sin(jt/4) = 
(l/>/2) + (l/v2)/- I have marked z x on Figure 10.C.2(b). 

What points do you get if you take (a) 9 2 = 0 (b) 9 3 = ji (c) 9 4 = nil (d) 9 5 = -nil 
(e) 9 6 = 2jt/3? Mark each of the corresponding z values on Figure 10.C.2(b). 
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You should have the following points. 

(a) z 2 = e° = 1, (b) z 3 = e jJt = -1 because e' 71 = cos n + y'sin n. 

The result e 7 ^ = -1 is also due to Euler and is known as Euler's Formula. It links three 
extraordinary numbers, e, it and j, to give the simple answer of-1. 

Similarly; (c) z 4 = e>* a = j, (d) z 5 = e~ J ' t/2 = -j, and (e) z 6 = e 2j3t ' 3 = cos 2jt/3 + j sin 2jt/3 
= -1/2 + (V372)y. 

As 6 increases, z = e' e is turning anticlockwise round the unit circle. 

lO.C.(d) What is e~'° and what does it do geometrically? 

We know that e> 6 = cos 9 +j sin 0. If we put -0 instead of 0, we will get 

e~ jd = cos(- 6) + j sin(- 9). 

From Section 5.A.(c) we know that cos(-6>) = cos 9 and sin(-#) = -sin 9 using the turn 
of P on its unit circle. This gives us 



e je = cos 9 - j sin 9. 



What will we get if we plot z = e~ J on an Argand diagram? 

If we start with 9 r = ji/4 as before, and let zf = e T/jr/4 , using the * to distinguish it from 
the previous zj , we shall have zf = cos jr/4 -y sin jt/4 = 1//2 - {\l^2)j. 

I have shown the turn to 9 X in Figure 10.C.3(a), and the position of zf in Figure 
10.C.3(b). 




-r 



Figure 10.C.3 

Now, using the same values of 9 which you used in Section lO.C.(c), try finding the 
corresponding values for z* yourself. Mark each of your angles on Figure 10. C. 3(a) and each 
of your values for z* on Figure 10.C.3(b). 



You should have zf = 1, zf = -1, z% = -j, zf=j and z% = -1/2 - (^[3/2)j. 

As 9 increases, z = e~' 9 moves clockwise round the unit circle about the origin. This means 
that each z* is the reflection of its corresponding z in the real axis, so each z* = z for that 
particular z. 
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For any given value of 9, e j6 and e je are conjugates of each other. 



If z = e' e then z = e~ je . \z\ = |z| = 1 so (e je )(e^ 9 ) = 1. 

This exactly slots with the result given by multiplying Zj = e' e and z 2 = e~ je and using 
the first rule of powers. 

If zj = e> e and z 2 = e~ je then 

z lZ2 = e' 6 e~ je = e^ e ~ je ) = e° = 1. 

lO.C.(e) A summary of the sin/cos and sinh/cosh links 

We now have the following results: 



e je = cos 0 +j sin 0, 
e~ je = cos 6 -j sin 9. 



From this, 

e /0 + e -je = 2 cos o gi v i n g cos # = I ( e /e + 
and e /e - e _7 ' e = 2/ sin 9 giving j sin 9 = \ (e' 9 - e~ je ). 

If we compare these results with cosh x = \ (e x + e~ x ) and sinh x = \ (e x - e~ x ) and put 
x =j9, we see that 



cosh (j9) 


= cos 9 


and 




sinh(/6>) 


= j sin 9. 



The similar behaviour of the sin/cos and sinh/cosh pairs is no longer mysterious. These 
functions are all intimately linked together and these links explain the differences between 
the rules for trigonometric and hyperbolic functions. Every time we square a sin or sinh as 
part of our working, we get the cross-over from the imaginary to the real from putting 
j 2 = -1. This is the reason for Osborn's Rule which says that the formulas for the 
trigonometrical and the hyperbolic functions are the same except that we must change the 
sign whenever the working involves multiplying two sins or two sinhs together. (We met this 
rule in Section 8.D.(d).) 

Since it was the series expansions which showed us the links between the pairs of cos and 
sin, and cosh and sinh, we can begin to see that it might be rather good to actually define 
these functions by using these series. This may seem alarmingly peculiar when you first 
meet it - after all, it is a very long way away from those original right-angled triangles. 
However, it does have many advantages, not least of which is that it is then possible to define 
what we mean by e z , sin z, cos z, sinh z and cosh z when z is complex. It is also possible to 
show that, if we do define sin and cos by means of series expansions, all the familiar 
properties are true if we consider only the cases for which z is real. 
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lO.C.(f) De Moivre's Theorem 

We know from Section lO.C.(b) that e jd = cos 9 + j sin 9. 

If we put n9 instead of 9, we get e'" e = cos n9 + j sin n9. 

But it can be shown that we can say e'" e = (e> e ) n = (cos 9 + j sin 9)". 

This gives us the following result which is known by the name of the French 
mathematician Abraham De Moivre. 



De Moivre's Theorem 

(cos 9 + j sin 9) n = (cos n9 + j sin n9) 



Here is one example of how we can use this. 
Putting n = 2 gives us 

(cos 9 + j sin 9) 2 = cos 29 + j sin 29 

so cos 2 9 - sin 2 9 + 2j sin 9 cos 9 = cos 29 + j sin 29. 

Now we use the fact that this equation is really two equations - we have two for the price 
of one, as we saw in Section lO.C.(a). 
Equating the real parts gives us 

cos 29 = cos 2 9 - sin 2 9 

Equating the imaginary parts gives us 

sin 29 = 2 sin 9 cos 9. 

We have been able to show both the double angle rules from Section 5.D.(d) very simply. 
Notice the neat way in which using complex numbers gives us both rules from the same 
piece of working! 

lO.C.(g) Another example: writing cos 56 in terms of cos 6 

De Moivre's Theorem makes it possible to find rules for multiple angles with a fraction of 
the work that would have been necessary up to now. As an example of this, I will show you 
how to write cos 59 in terms of cos 9 and sin 59 in terms of sin 9, two results which 
examiners seem to be rather fond of. 

Since we want rules for cos 59 and sin 59, we put n = 5 in De Moivre's Theorem. This 
gives us 

(cos 9 +j sin 9) 5 = (cos 59 + j sin 59). 

To multiply out the left-hand side, we shall have to use a binomial expansion and this 
means that we shall need to know the right binomial coefficients to use. 

^^^^ Don't leave these coefficients out! 

Since n = 5, and 5 is quite a small number, much the easiest way to find the binomial 
coefficients is to use Pascal's Triangle. We wrote this down in Section 7A.(a). 
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The line of coefficients given by the triangle when n = 5 is 
1 5 10 10 5 1 



Because we shall be using Pascal's Triangle a lot in this chapter, here is a brief reminder 
of how it works. Each new line is made from adding the two numbers nearest in the line 
above, unless it is at the end of the line when the single number closest to it is used. So, for 
example, if n = 6, the next line of the triangle would be 

1 6 15 20 15 6 1 

Remember that the triangle starts with 1 1 when n = 1 . 
Putting in the coefficients in our example, we have 

(cos 9+j sin Of = (cos 0) 5 + 5 (cos 0) 4 (j sin 0) + 10 (cos 0) 3 (j sin 0) 2 

+ 10 (cos 0) 2 (j sin 0) 3 + 5 (cos 0) (j sin 0) 4 + (j sin 0) 5 . 

I have written this working out in detail because mistakes often creep in here. In 
particular, notice that it is the whole of (J sin 0) which is being raised to the different 
powers. 

Now we replace j 2 by -l,j 3 by -j etc. and tidy up generally. 
Doing this gives us a right-hand side of 

cos 5 9 + 5j cos 4 9 sin 9 - 10 cos 3 0 sin 2 0-10/ cos 2 9 sin 3 9 + 5 cos 9 sin 4 9 

+ j sin 5 9. 

We also know from De Moivre's Theorem that (cos 9 + j sin 0) 5 = cos 59 + j sin 59. 
Now we equate the real and imaginary parts of this equation, so getting double worth 
from our labours so far. 

Equating the real parts gives us 

cos 59 = cos 5 0-10 cos 3 9 sin 2 9 + 5 cos 9 sin 4 9 (1) 

and equating the imaginary parts gives us 

sin 50 = 5 cos 4 0 sin 0 - 10 cos 2 0 sin 3 0 + sin 5 0. (2) 

Finally, if we want the RHS of equation (1) to be entirely in terms of cos 0 and the RHS 
of equation (2) to be entirely in terms of sin 0, a proviso which examiners frequently make, 
what should we use? 



We use the identity sin 2 0 + cos 2 0 = 1 to do the necessary adjustments. You may be 
wondering how this identity is going to help us with sin 4 0. Can you see how we can 
use it? 



We can say that 

sin 4 0 = (sin 2 0) 2 = (1 - cos 2 9) 2 = 1 + cos 4 0-2 cos 2 0. 
This gives us 

cos 50 = cos 5 0-10 cos 3 0(1- cos 2 0) + 5 cos 0(1 + cos 4 0-2 cos 2 0). 
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Tidying this up, we have 

cos 59 = 16 cos 5 6 - 20 cos 3 9 + 5 cos 9. 

Now do the very similar process for yourself of equating the imaginary parts to show 
that 

sin 59 = 16 sin 5 9 - 20 sin 3 9 + 5 sin 9. 

exercise io.c.2 Try the following questions yourself. 

(1) Show, using De Moivre's Theorem, that 

(a) cos 38 = 4 cos 3 8 - 3 cos 8 and (b) sin 38 = 3 sin 8 - 4 sin 3 8. 

(2) Show that cos 78 = 64 cos 7 0 - 112 cos 5 0 + 56 cos 3 0 - 7 cos 0. 

lO.C.(h) More examples of writing trig functions in different forms 

There is a rather similar problem (also popular with examiners) which we can now solve. 
This is basically a question of going back the other way. 

How, for example, could we write cos 7 9 in terms of cosines of multiples of 91 

To do this, we use the results of Section 10.C.(e) that 

cos 9 = \ (e' e + e~ Jd ) and j sin 9 = \ (e' e - e- je ). 
If we replace 9 by n9 we get 

cos nO = \ (e nje + e~ nje ) and j sin n 9 = \ (e nje - e~ nje \ 
To save some writing, it is handy to let e' 6 = z at this point, so that 

e nj0 = ^0y = z n ^ ^jO = ^ = 

We can then rewrite the above pair of results as follows: 



lfe' e 


= z, then cos t 


) = \{z+ 1/z) 


and j sin 9 


= \(z- l/z). 


Also, 


cos n9 = 5 (z n + 


1/z") and j 


sin n9 = \ (z" 


- 1/z"). 



So how will we now set about writing cos 7 9 in terms of cosines of multiples of 91 We can 
say 

cos 7 9 = [5 (z + 1/z)] 7 = (|) 7 (z + 1/z) 7 . 

Next, we use Pascal's Triangle to get the binomial coefficients for the expansion of 
(z + 1/z) 7 . The line for n = 7 comes immediately from the line for n = 6 which we wrote down 
in the previous section. This was 

1 6 15 20 15 6 1 
So, when n = 7 we get 

1 7 21 35 35 21 7 1 
Using these coefficients gives us 

(j) 7 (z + 1/z) 7 = (j) 7 [z 7 + 7(z) 6 (1/z) + 21(z) 5 (1/z) 2 + 35(z) 4 (1/z) 3 

+ 35 (z) 3 (1/z) 4 + 21 (z) 2 (1/z) 5 + 7(z)(l/z) 6 + (1/z) 7 ]. 
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This tidies up very neatly as 

(j) 7 (z + 1/z) 7 = (|) 7 [(z 7 + 1/z 7 ) + 7(z 5 + 1/z 5 ) + 21(z 3 + 1/z 3 ) + 35(z + 1/z)]. 

Now all that remains to be done is to use cos n9 = \ (z" + 1/z") here. This gives us 

cos 7 9 = (|) 7 (2 cos 19 + 14 cos 59 + 42 cos 39 + 70 cos 9). 

= (|) 6 (cos 19 + 1 cos 59 + 21 cos 39 + 35 cos 9). 

If you want sin 7 9 in terms of multiples of sin 9, you just start with 

(j sin 9) 1 = (\) 7 (z - 1/z) 7 , 

and then work in a very similar way. You will find that the js will all cancel out. 

You can also get slightly quirky extra results by differentiating a previous result. 
For example, if we differentiate the expression which we have just found for cos 7 9 with 
respect to 9, we get 

-7 cos 6 9 sin 9 = (j) 6 (-7 sin 19 - 35 sin 59 - 63 sin 39 - 35 sin 9) 

so cos 6 9 sin 9 = {\f (sin 19 + 5 sin 59 + 9 sin 30 + 5 sin 9). 

(Examiners rather like this one, too.) 

exercise io.c.3 (l) Show that cos 5 8 = (j) 4 (cos 5# + 5 cos 38 + 10 cos 8). 

(2) By rewriting 0' sin 8) 5 in a different form, show that 
sin 5 8 = (j) 4 (sin 56-5 sin 38 + 10 sin 8). 



lO.C.(i) Solving a differential equation which describes SHM 

We can now see that the method for solving differential equations described in Section 
9.C.(c) and questions (9) and (10) of Exercise 8.C.4 will also work for the equation 
d 2 xldt 2 = -x. (Newton's dot notation is quite often used for this kind of equation. We write 
d 2 x/dt 2 as x and dxldt as x. The differential equation then becomes 'x = -x.) 

We will find the solution for the particular case when we also know that x = 3 and 
dxldt = 4 when t = 0. We have 

d 2 x _ 

~de~~ x 

and we try putting x = e kt . This gives us 

dx d 2 x 

— = ke kt and = k 2 e . 

dt dt 2 

If x = e kt is a solution of the given equation then we must have k 2 e k> = -e k> or 
(k 2 + 1) e kt = 0. 

Since e fa is never equal to zero, we would have to have k 2 = - 1 giving us k =j ork = -j. 
Making the solution as general as possible gives us x = Ae'' + Be''* where A and B are 
constants. 

Now we use Sections 10.C.(b) and (d) to say that 

e' f = cos t + j sin t and e~ Jt = cos t - j sin t. 
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This gives us 



x = A (cos t + j sin t) + B (cos t—j sin t) 

so x = (A + B) cos t + (A - B)j sin t. 

If we let C = J + 5 and D=j (A- B), we have 

x = C cos £ + D sin f. 

We are told that x = 3 when ^ = 0 so 3 = C. 
Also, we know that dxldt = 4 when t = 0. 
Now, 

dx 

— = - C sin f + D cos t so 4 = D. 
dt 

This gives us the solution 

x = 3 cos ? + 4 sin 

Examples (a), (c) and (d) in Section 9.C.(c) also all give solutions for x from the 
differential equation d 2 xldt z = -x. Each of these solutions is different because the values of 
x and dxldt when t = 0 are different in each case. 

I showed in Section 5.D.(f) that 3 cos t + 4 sin t can also be written as 5 sin (t + a) with 
a = tan -1 4. You can see on Figure 5.D.3(a) how x changes with t. 

This solution fits the general solution of an equation like this of x =A sin (cot + e) if we 
put A = 5, a) = 1 and e = tan -1 4. 

Now try solving the equation d 2 x/dt 2 = -9x for yourself, given that we also know that x 
= 0 and dxldt = 6 when t = 0. 



This time, putting x = e k> gives us k 2 + 9 = 0 so & = ±3/'. 

We now have the solution x =^e 3/Y + Be~ 3Jt . 

Using 

e 3/Y = cos 3? + 7 sin 3t and e~ 3/Y = cos 3t - j sin 3? 

gives us 

x = (A+ B) cos 3? +7 (A - B) sin 3* = C cos 3* + D sin 3* 
with C = A + B and D=j (A - B). 

Also, x = 0 when t = 0 so 0 = C, and dx/cfr = 3Z) cos 3? = 6 when t = 0 so D = 2. 

This gives us the solution that x = 2 sin 3t. I showed what this particular x looks like in 
the first example at the end of Section 5.C.(b). 

Comparing this solution with x = A sin (cot + £), we have A = 2, co = 3 and £ = 0. 

It is also possible to write the general SHM solution in the form x = A cos (cot + e). In 
this particular case, this would give us x = 2 cos (3t - nil). 

10.C.(j) A first look at how we can use complex numbers to describe electric circuits 

In Section 5.D.(g) we said that the combined effect of particular components in electric 
circuits, such as inductors, capacitors or resistors, can have the effect of making the current 
and voltage be out of phase with each other if there is an alternating electromotive force 
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(e.m.f.). This phase difference will continue to be present even when the circuit has settled 
down after being switched on. (Initially there will also be transient responses, but these will 
die away leaving the circuit in a steady state.) 

Suppose that the current in a single branch of a circuit network is given by 
/ = v 3 cos t - sin t and that the frequency is constant throughout the circuit. 

At a junction, we know from Kirchhoff 's junction rule that what goes in must come 
out, or that the net current flowing into a junction must be equal to the net current 
flowing out of it. We are supposing that the other currents involved at this junction have 
the same frequency, but they could certainly have different amplitudes and phase 
constants. Therefore we would be faced with having to combine a string of terms like 
2 cos (t + jt/6), 3 cos (t - ji/4) etc. 

Adding these in the form of trig functions is unpleasantly complicated, but we can use the 
properties of complex numbers to make things much simpler. 

Here's how it would work in the case above. 

From Section lO.C.(b), we know that e' 9 = cos 9 +j sin 9. 

We have also seen there that, as 9 increases, e j d moves round the unit circle. 

If we call the variable t instead of 9, and let t represent time, then z = e jt will have moved 
right round the unit circle once in 2tc seconds. 

How can we now use this to help us describe the particular current which we started with, 

1 = yf3 cos t - sin f? 

Using the methods of Section 5.D.(g) we can rewrite v'3 cos t - sin t in the form 

2 cos (t + 7t/6). (This is, in fact, the answer to the first question of Exercise 5.D.1 at the end 
of that section.) 

As we saw there, we can then use the motion of P on its circle to help us to see how the 
wave function of the current behaves. I show this again below in Figure 10.C.4. 




X Can ritprtstnt fcht currewt 1= 2. ce»s(t + 5fc} 
Figure 10.C.4 



Now here is the step which links all the work which we were doing there with our 
present work on complex numbers, and which makes it possible for us to avoid a lot of 
tedious trig. 

We can think of the current 1=2 cos (t + jt/6) as being the real part of the complex 
number 2e /(f + jr/6) , or 2 exp[/(r + jt/6)] as this is sometimes written to avoid cramped-up 
complicated powers. 
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We can also use the first rule of powers to say that 

2 exp[/0 + jt/6)] = 2 expO'O exp(/7t/6) or (2e jJt,6 )(e jt ). 

(We know from the rule for multiplying two complex numbers that we must multiply 
their moduli and add their arguments.) 

Now, 2e JJt/6 has a modulus of 2 (the radius of the circle), and an argument of jt/6. It tells 
us the starting position of P and is called a phasor. 

e j> has a modulus of 1 and an argument of t, and this argument tells us how far P has 
turned after a time of / seconds. 

Multiplying the two together has the effect of using the e'* to drive P round the circle of 
radius two units. P starts from 2e** 16 , and one full cycle takes 2jt seconds. 

One benefit of this is that, if we have all the currents at a junction represented by the real 
parts of complex numbers, we will be able to add these currents by adding the complex 
numbers. We know that the real parts will remain separate from the imaginary parts so, at 
any stage of the working, the real part will still represent current. There are other advantages 
too, which you will discover if you are studying the theory of what is happening in these 
circuits as part of your other courses. 

If the currents are given as sin functions, it is equally possible to represent them by the 
imaginary parts of complex numbers. The only difference would be that they would be being 
represented on the vertical or imaginary axis instead of the horizontal or real axis. 

Also, everything would work equally well with a different common frequency for a 
circuit. So, for example, if we had started with the particular current of / = 4 cos (3? + tt/8), 
then we would use the real part of the complex number 4 exp[/' (3? + tt/8)] to represent this. 
The starting point of P would then be represented by the complex number 4e** /8 , and the 
e 3jt would then drive P round the circle as time passes. A full cycle would take 2tt:/3 seconds. 
I show this in Figure 10.C.5 below. 




after time t 

START when t=© ^I^n btj 0\t 

Complex number 4 e- . 

This is the phasor 



x represents 1 = 4 <»s(St+ 1 ^si) 
wh»'ch is fte teal part of (4e? 1 *Xe? 3t ) 



Figure 10.C.5 



10.D Using complex numbers to solve more equations 

lO.D.(a) Finding the n roots of z" = a + bj 

We have already found solutions to some particular examples of this kind of equation, such 
as z 4 = 1 at the beginning of the chapter, and z 3 = 1 in question (4) of Exercise 10.B.2. We 
did this by thinking how the turns should go to make the answers fit in with the rules for 
multiplying complex numbers. 
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In this section, we look for some general rules which will help us to solve any equation 
of this kind. We do this by using the relationship e' e = cos 6+j sin 9 which makes it possible 
for us to work out the answers for the n roots of the equation z" = a + bj in a very simple 
way. I shall use the thinking point from Exercise 10.B.2 at the end of Section lO.B.(c) to 
explain how this works. I asked you there if you could find the four roots of z 4 = -1, so you 
will also be able to see now if your answers were right. 

We start by noticing that \z 4 \ = 1, because z 4 = -1. 

We also know that arg(z 4 ) = ji. 

I show a sketch of z 4 marked on the the unit circle in Figure 1 O.D.I (a). 




Figure 10.D.1 



Geometrically, we can see that z x is a root since \z x | = 1 and arg (z \ ) = 4 arg z x = it. The 
other three roots must also all have a modulus of 1 since, when multiplied by themselves, 
they give an answer with a modulus of 1 . Because of this, all the roots will lie on the unit 
circle. They will be given by successive quarter turns from z x , since four times any quarter 
turn makes a full turn. I have shown the four roots on Figure lO.D.l(b). 

If we take z 2 , for example, we will have arg(z 2 ) = jt/4 + jt/2 giving arg(z 4 ) = 4(jt/4 + nil) 
= Ji + 2tc using the rule for multiplying complex numbers from Section lO.B.(b). The extra 
full turn makes no difference to the point where we finish up, so arg(z 4 ) = Jt. 

Now we see how all this fits in with using e' e = cosO + j sin 9. 

We have 

z 4 = -1 = cos (jt) + j sin (tt) = e' n 

so Zl = (e j7l ) m = e jM = cos(tt/4) + j sin (tt/4). 

We can get all the roots by writing the arg of z 4 so that it includes the possibility of any 
whole number of extra full turns. 

If we say arg(z 4 ) = it + 2kn, where k is any whole number, then 

z 4 = e i(n + 2/cjt) SQ z = ^(pt + 2for))l/4 = ^(ji + 2for)M 

This power of e is rather cramped and difficult to read. Often such powers are written 
using exp instead of e so, for example, e jd would be written as exp(j9). Here we would 
have 



z = exp 



(ji + 2kn) 
j ; 
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Putting k = 0, 1,-1, and -2 gives the four distinct roots of exp(jjz/4), exp[/(3jr/4)], 
exp[/(-jr/4)], and exp[/(-3jr/4)]. These are the four roots z l5 z 2 , z 3 and z 4 which I have 
shown in Figure 10.D.l(b) above. There is one quarter of a full turn between each root 
and the next one, and any other value of k will give you one of the roots which you 
already have. 

In exactly the same way, we can make a general rule for solving any equation of the form 

z" = a + bj. 

Let z n = w, so \z"\ = \w\ and arg(z") = arg w = 9 say. 

Then, just as we did in the previous example, we can say that arg z n = 6 + 2kji where k 
is any integer (that is, whole number), since adding any number of full turns leaves the 
argument unchanged. 

So now we have 

w = a + bj = \w\ (cos# + j sin#) = \w\ e je = \w\ exp[j(9 + 2ht)] 
Therefore 

z = w l/ " = (\w\ exp\j(6 + 2ki)] l,n = \w\ 1/n exp 
We summarise this result as follows: 



(0 + 2hx) 



A general rule for finding roots 

{6 + 2kji)~ 

j 

n 

where the n distinct roots are given by the n different integer values of k which 
make the arguments lie in the range -71 < arg z < 71. 



\iz n = w then z = \w\ lln exp 



You should always draw a sketch of the roots on an Argand diagram so that 
you choose the right values for k. 



example (l) We will now find the eight distinct roots of z 8 = 8 + 8>/3/ using the 
above result. 

I let z 8 = w, and start by drawing the sketch of w shown in Figure 
10.D.2(a). We have 



|w| = v/8 2 + (8V3) 2 = ^256 = 16 

and 

arg w = tan = tan V 3 = — . 

V 8 / 3 

Using the symmetry of the eight roots, I now draw the sketch of where 
they will be in Figure 10.D.2(b). I am using arg z x = g arg w. 

Since we can add any number of full turns to arg w without altering 
it, we can say arg w = 2krc + it/3 where k is any whole number. 
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Figure 10.D.2 



Therefore 

w = 16[cos(2&7r + jc/3) + j sm{2kn + jt/3)] 
= 16 exp \J(2kn + jt/3)]. 
So, letting z 8 = w, we can say 

'2kjx + jxI3 s 



z = w 1/8 = (16) 1/8 exp 



= V2 



exp 



6fcr + it 
24 



From the sketch, we can see that we should put k = 0, 1, 2, 3, -1, -2, 
-3 and -4. 

Equally, you could find the solutions to z 8 = 8 + 8^3/ by just using the geometry of your 
sketch. 

You know that |z 8 | = 16 and arg z 8 = jt/3 so therefore z 1 (the first root you come to when 
you turn anticlockwise from the positive real axis) is given by \z { \ = 16 1/4 = y2 and 
arg z l = g X tt/3 = jt/24. 

All of the roots have this same modulus of yfl. 

Also, each of the eight roots is one eighth of a full turn (that is, 2jt/8 or tt/4) from the 
next one. Therefore, by looking at the diagram, it is easy to see what the correct argument 
for each one should be. 

If you want your answers in the a + bj form, you simply use the relation of 
e' 9 = cos# + j sin 0 to do the conversion. 

For example, if you want z 3 , all you need to do is to put k = 2. This gives 



z 3 



j 2 exp 



12jt + Jt 



24 



V 2 ex P7 



13jt 
24 



= V2 cos 



13jt 



j sin 



\3jx 



= -0.185 + 1.402/ to 3 d.p. 



24 / V 24 

(Notice that these numbers look reasonable from the sketch.) 
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exercise io.D.1 Try solving these equations yourself now. For each question, sketch the starting 

number and its roots on an Argand diagram. 

(1) Find the cube roots of 277. 

(2) Find the four solutions of z 4 = 1 + /'. 

(3) Find the three cube roots of \ (3 - 47°) both in mod/arg form, and in a + bj 
form, giving a and b correct to 2 d.p. 

(4) Find the fifth roots of -4 + 4/ in exp form. 

(5) Find the six distinct solutions of z 6 = -4 - 4V3 / in exp form. 

fz-iV 

(6) See if you can solve = -4 

\z + ij 

(This is not as horrible as it looks. Think about what we have already found in this 
section before rushing into binomial expansions.) 



lO.D.(b) Solving quadratic equations with complex coefficients 

Suppose you have an equation like z 2 + (2 + j)z - (3 + j) = 0 to solve. 

Students are sometimes alarmed by the presence of complex coefficients. There is no 
cause for concern, however. The usual quadratic equation formula can be used with an 
equation like this. 

Here, we have a = 1, b = 2 + j and c = -(3 + j) so substituting in the formula gives 

_ _ -(2 +j) ± V(2 +jf + 4(3 +j) _ -(2 + 7) ± JTsTsj 

2 2 

Check that you also get 1 5 + 87' when you tidy up under the square root sign. 
Now we have to find the two square roots of 15 + 87. 

We use the method of Section 10. C. (a) to do this, so we let w 2 = 15 + 87 where w = u 
+ jv. (I have not used the letter z since it is already being used in this example.) We have 

(u + jv) 2 = u 2 - v 2 + 2juv = 15 + 87. 
Equating the real parts gives 

u 2 -v 2 = 15. 
Equating the imaginary parts gives 

4 

2uv = 8 so v = — . 

u 

Therefore 

u 2 - (4/u) 2 = 15 so u 4 - \5u 2 - 16 = 0 so (u 2 - 16)(« 2 + 1) = 0. 

u is real so the only two possible solutions are u = +4 or -4 giving v = +1 
or -1. 

The square roots of 15 + 87 are ±(4 + j). (Notice that this method of solution 
automatically gives us the ± which is included in the quadratic equation formula, and that 
together these roots would make a straight line on an Argand diagram.) 

Now, substituting them back in the quadratic equation formula, we find that the two 
possible solutions for z are given by 

-(2 +j) ± (4 +j) ^ ^ . 
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Notice that these two roots do not form a conjugate pair because the coefficients of the 
original equation were not all real, but they do fit the rules from Section 2.D.(e) that the sum 
of the roots is -bla, and the product of the roots is da. (Check this for yourself.) 

exercise io.D.2 Try solving the following quadratic equations yourself. (You may find that your 

answers to Exercise 10.C.1 are helpful here.) 

(1) z 2 - (3 - 2/)z - (1 + 3/) = o (2) z 2 + (3 - 4/)z - (22 + 67) = o 

(3) z 2 + (2 + j)z - (3 + j) = o (4) z 2 + z - (5 + j) = 0 (5) z 2 - z - 1 + 3/ = o 

(6) z 2 - 2(1 + y)z + 5(1 - if) = 0 (7) z 4 - (1 + /)z 2 + / = o 

lO.D.(c) Solving cubic and quartic equations with complex roots 

How we do this is very similar in many ways to what we did in Section 2.E.(a) in order to 
solve cubic and quartic equations with real roots. You might find it helpful to go back there 
before going on with this section. 

The following two examples will show you these similarities, and also how we use the 
extra information if we are given one complex root. 

example (1) We will solve f(z) = z 3 - 5z 2 + 9z - 5 = 0. 

We can easily find one root here. (Can you spot it?) 



Putting z = 1 gives f(z) = 0 so z = 1 is a root, and this means that (z 
Matching up what we know gives us 

f{z) = z 3 - 5z 2 + 9z - 5 = (z - 1) (z 2 + pz + 5) 

where p stands for the number which we don't yet know. 
Looking at the terms in z 2 , we get 

-5z 2 = -z 2 + pz 2 so p = -4. 

(Check for yourself that this is right by looking at the terms with z.) 
The other two roots of f(z) = 0 will come from solving z 2 - 4z + 5 = 

4 ± V 16 - 20 4 ± 2j 



The three roots of z 3 - 5z 2 + 9z - 5 = 0 are z = 1, and z = 2 ± j. 

example (2) Now we'll solve z 4 + az 3 + bz 2 - 5z - 26 = 0 given that z = 2 + 3/ is 

one root of the equation, and a and b are real. We'll also find the values 
of a and b since examiners often like being given this information. 

Since the coefficients of this equation are all real numbers, any 
complex roots of it must come in conjugate pairs, because otherwise 
multiplying out the factors would give us stray unwanted fs. 

Therefore what else must be a root? 



There must be a second root of z = 2 - 3j. 

Now, in just the same way as we said in the last example that if 
z = 1 is a root of f(z) = z 3 - 5z 2 + 9z - 5 = 0 then (z - 1) is a factor of 
f(z), here we can say that both (z - (2 + 3/')) and (z - (2 - 3/)) are 
factors. 



- 1) is a factor. 



0. They are 
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These look rather alarmingly complicated, but if we multiply them 
together we shall get something very much nicer. We shall have 

(z-(2 + 3j))(z-(2-3j)) = z 2 -(2 + 3j)z-(2-3j)z 

+ (2 + 3/)(2-3y)=2 2 -4z+13. 

The imaginary part has disappeared because the roots are conjugates, so 
making it possible for the original equation to have only real coefficients. 
Now we can say that 

z 4 + az 3 + bz 2 -5z-26 = (z 2 -4z+ 13) (z 2 +pz- 2) 

matching up the first and last terms in the second bracket, since we know 
that when we multiply the two brackets together they must give us z 4 and 
-26. Again,/) is standing for the number which we don't yet know. 

Just as in the first example, we can now match up the terms for the 
various powers of z, since the right-hand side is just another way of writing 
the left-hand side. 

We have already matched the z 4 and number terms. 

Matching the terms in z, we get 

-5z=13/?z + 8z so p = -\. 

Now we have 

z 4 + az 3 + bz 2 - 5z - 26 = (z 2 - 4z + 13)(z 2 -z - 2) 

so we'll match up the terms in z 3 to find a, and in z 2 to find b. Notice here 
that we have to get the arithmetic right first time; we can't use these 
equations as checks because we need them to find new information. 
Matching up the terms with z 3 gives us 

az 3 = -4z 3 -z 3 so a = -5. 

Matching up the terms in z 2 gives us 

bz 2 = 13z 2 -2z 2 + 4z 2 so 6=15. 

(We have to be careful with this one because there are three ways in 
which we can make z 2 on the right-hand side.) 
We now have 

z 4 -5z 3 + 15z 2 -5z-26 = (z 2 -4z+ 13)(z 2 -z-2) = 0. 

To find the other two roots, we now just have to solve the equation 
z 2 -z - 2 = 0. Now z 2 -z - 2 = (z - 2)(z + 1) so the two solutions are z = 2 
and z = -1 . In this particular example, they have both turned out to be real. 

It has been shown by mathematicians that every equation like the one that we started with here 
will have the same number of roots as the highest power of z, if we both allow complex roots, 
and count roots such as x = 1 from (x - l) 2 = 0 as a double root. (Here, the curve is just sitting 
on the x-axis instead of cutting it in two places.) Also, if all the coefficients of an equation are 
real numbers, then the roots must come in conjugate pairs. How does this apply to the two roots 
of z = 2 and z = -1 which we have just found? 



Each of these is its own conjugate because they are real numbers. 
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example (3) Now we will try one together, before I give you an exercise on these. 

Given that a and b are real, and that z = 2 + j is a root of the equation 
z 4 + az 3 - 9z 2 + bz + 30 = 0, find the values of a and & and the other 
three roots. 

(1) What else must be a root? 

Since a and b are real, the conjugate of 2 +j which is 2 -j must also 
be a root. 

(2) From these two roots, what two factors must you have? 

We have the two factors 

(z-(2+j)) and (z-(2-j)). 

(3) When you multiply these two together, what single nice 
factor do you get? 

You get 

z 2 - (2 +f)z - (2 -j)z + (2 +j)(2 -j) = z 2 - 4z + 5. 

(4) Use all this information to write z 4 + az 3 - 9z 2 + bz + 30 
as two factors multiplied together. 

You should have 

z 4 + az 3 - 9z 2 + bz + 30 = (z 2 - 4z + 5)(z 2 + pz + 6) 
where p is standing for the number which we still have to find. 

(5) Now, match up the terms in z 2 to find p. 

You should have 

- 9z 2 = 5z 2 + 6z 2 - Apz 2 so p = 5. 

(Notice the three ways of getting z 2 on the RHS again!) 
We now have 

z 4 + az 3 - 9z 2 + bz + 30 = (z 2 - 4z + 5)(z 2 + 5z + 6). 

(6) Now match up the terms in z 3 and z to find a and b. 

You should have 

az 3 = -4z 3 + 5z 3 so a = 1, 
fe = -24z + 25z so 6=1 also. 

(7) Finally, what are the other two roots of the given equation? 
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We have 

z 2 + 5z + 6 = 0 if (z + 3)(z + 2) = 0 
so the other two roots are z = - 3 and z = - 2. 

It's worth noticing that if you were only asked for the roots then you could have left out 
step (6) altogether. You don't need to know the values of a and b to solve the equation. 

It is also possible to answer the original question by substituting z = 2 +j in the original 
equation, since we know that it fits it. Then we would equate the real and imaginary parts 
to find a and b, and also find the first factor with z 2 as we did above. Then you either use 
long division to find the other factor, or match up the terms as we did above. This method 
involves using binomial expansions to work out (2 + j) 4 and (2 + j) 3 , always a fruitful source 
of arithmetical mistakes. I think that you will find that the method I have shown you is 
easier. 

exercise 10.D.3 Now have a go at solving these equations yourself. 

(1) Given that a and b are real numbers, and that z = i - 2/ is a root of the 
equation z 4 - 3Z 3 + az 2 + bz - 30 = o, find a and b and the other three roots. 

(2) Given that a and b are real numbers, and that z = / is a root of the equation 
z 4 + az 3 + bz 2 - 4Z + 13 = 0, find the values of a and b and the other three 
roots. 

(3) Given that a and b are real numbers, and that z = 1 - / is a root of the 
equation z 4 + az 3 + bz 2 + 1 = 0, find a and b and the other three roots. 

10.E Finding where z can be if it must fit particular rules 

In Chapter 3, when we were working with functions, we often found that we had to restrict 
the choice of possible values for x in order to make the functions work as we wanted. One 
example of this is given by f(x) = y]4 -x. To make f(x) real, we have to have x < 4. This 
means that we have to restrict the possible values of x to just one part of the number line 
which makes up the horizontal axis. 

The same kind of thing can happen with applications of complex numbers. To make 
things work in the way that we want, we may often find that we have to restrict the possible 
values of z. Since z is made up of both an x and y component from its real and imaginary 
parts, this restriction may lead to the exclusion of any area or region of the complex plane 
because it may affect the possible values of both x and y. (The complex plane is the flat 
surface shown on an Argand diagram.) 

Physical quantities which have both magnitude and direction, and which are acting in a 
flat surface, can often be represented very conveniently by complex numbers. Examples of 
such applications are two-dimensional problems involving lines of electric or magnetic 
force, or streamlines in fluid flow. If the physical quantities you are considering need three 
dimensions to describe them, complex numbers will no longer be any use. You would 
probably then use vectors, as I explain in the next chapter. 

The following section is designed to give you more practice in using complex numbers 
and seeing how particular restrictions could be described. 

lO.E.(a) Some simple examples of paths or regions where z must lie 

If complex numbers are described by saying that they obey some rule, we can use this 
information to show the possible points in the complex plane where these numbers may 
lie. 



458 



Complex numbers 



example (1) Suppose we are told that |z| = 3. 

This means that the distance of the point z from the origin is always 
3 units, so therefore z can lie anywhere on the circle shown in the 
Argand diagram in Figure 1 O.E.I. 




We could also write the equation of this circle using xs and ys as we 
did in Section 4.C.(d) by letting z = x + jy and using the relationship \z\ 
= V* 2 + y 2 - Since \z\ = 3, we then have x 2 + y 2 = 3 2 = 9 which is the 
equation of the circle with centre (0,0) and radius 3 units. All the 
possible positions of z lie on the path given by this circle. Such a path 
is sometimes called the locus of z. 

There is a third way of describing this particular circle. We know 
from Section 10.C.(c) that z = e' e gives the unit circle about the origin 
as 0 varies. Therefore z = 3e' e gives the unit circle enlarged by a factor 
of 3, and this is our circle. 

example (2) Suppose we are told that |z| < 2. 

We want the distance of z from the origin to be less than or equal to 
2 units, so it must lie either on or inside the circle shown in Figure 
10.E.2 below. This time the possible positions of z make a region in the 
plane, rather than a path given by a line as in Example (1). (You may 
find it helpful to use your own colour on these diagrams to highlight the 
different possible positions of z.) 

This region can be described as either |z| < 2 or as x 2 + y 2 < 2 2 
which is the same as x 2 + y 2 < 4. 
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example (3) Suppose we are told that arg z = jt/3. 



The argument of these complex numbers is fixed but their modulus 
can take any value. This means that z can lie anywhere on the straight 
line shown below in Figure 10.E.3. 



Aim 




<Xf<j r - 



Figure 10.E.3 



EXERCISE 10.E.1 



The arrow on the end of the line which represents the possible values of 
z shows that it can be extended indefinitely. Such a line is sometimes called 
a half-line because it can only be infinitely extended in one direction. 
Extending it in the opposite direction would include points for which arg z 
= -lull, and this doesn't fit the given condition. 

Use separate Argand diagrams to show the possible positions for z for each of the 
following. 

(1) |z| = 4 (2) |z|<i (3) argz = -7t/6 (4) argz = o (5) \z\>i 
(When the boundary of a region is not included, show it with a dashed line.) 

lO.E.(b) What do we do if z has been shifted? 

Each of the examples which we have looked at so far has been related directly to the origin. 
What will happen to the path if the information that we are given concerns a complex number 
which has been shifted away from there? 

example (1) Suppose we are told that |z - 5| = 2. Where is the path which describes the 
possible positions of z now? 

If we let z - 5 = w then we know that the path for w is the circle about 
the origin with a radius of 2 units. But what do we have to do to z to get wl 



We have to subtract 5 from z which means that we are taking 5 away 
from the real part of z since 5 itself is real. Therefore we are shifting z by 5 
units to the left to get w. So the circle giving the path for z must be 5 units 
to the right of the origin. I show both these circles on the pair of Argand 
diagrams in Figure 10.E.4. 

We can check that the path of z is in the right place by putting z = 7. 
Then |7 - 5| is indeed equal to 2, and we see from the diagram that z = 7 is a 
point on the path of z. The equation of this circle in terms of x and y is 
given by (x- 5) 2 +y 2 = 2 2 orx 2 - lOx + y 2 + 21 = 0 because its centre is at 
(5, 0) and its radius is 2 units. (If you need help with this, you should go to 
Section 4.C.(d).) 
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Figure 10.E.4 



example (2) What would the path of z be if we are told that |z - 3j\ = 1? 



Try drawing a sketch for yourself of where you think it would be. 
Check your sketch by using z = 4j as a test point, and write down 
the equation of the path of z in terms of x and y. 



If we put w = z - 3j then the path of w is the circle about the origin 
with a radius of one unit since \ w\ = I. 

To get w from z, we have to subtract 3 from the imaginary part of z. 

Therefore the path of z is the circle whose radius is one unit and 
whose centre is at the point (0,3)- That is, the centre of the circle is at 
the point which represents the complex number 3j. I show the w circle 
and the z circle in the pair of Argand diagrams in Figure 10.E.5. 

The equation of this circle can also be written as x 2 + (y - 3) 2 = l 2 
or x 2 +y 2 -6y + S = 0. 
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Figure 10.E.5 
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example (3) The general case of \z - (a + bj)\ = k. 



We can now see what the path of possible points for z will be if we 
are told that \z - (a + bj)\ = k, where a + bj is any given complex 
number and k is any given real number. 

This is because if we put w = z - (a + bj) then the path of w is the 
circle about the origin with a radius of k units. To get to w from z, we 
have to subtract a units from the real part of z and b units from the 
imaginary part of z. 

Therefore the path of z is the circle whose radius is k units and 
whose centre is at the point (a, b), that is, at the point which represents 
the complex number a + bj. If z = x + jy then the equation of this circle 
can be written as (x - a) 2 + (y - b) 2 = k 2 . 

example (4) Suppose we are told that arg (z + 1) = 7t/3. 

This time, we want to find the path of z from knowing what its 
argument is after it has been shifted. 

If we let z + 1 = w then we can immediately sketch the position of w 
on an Argand diagram. I have done this in Figure 10.E.6(a). 




argand diagram for W Organ** dt"agrt*rr> for 
Figure 10.E.6 



Since z has been shifted to the right by one unit to give w, we can 
now draw in the path of z on its Argand diagram. I show this in Figure 
10.E.6(b). 

exercise 10.E.2 Show the paths giving the possible positions of z from the information given in 

each of the following questions. (I have included one rogue question which is 
impossible. Which one is it?) 

(1) |z - 3| = 1 (2) \z - /| = 2 (3) |z + 4| = 3 (4) |z + 2/1 = 3 

(5) |z - i| = -2 (6) |z - (2 + 3/)| = 4 (7) |z + 3 - 2/| = 3 

(8) arg(z - 2) = jt/6 (9) arg(z + j) = tt/4 (10) arg(z + 3) = - 71/3 

lO.E.(c) Using algebra to find where z can be 

It is possible to use algebra to find where z must be in the complex plane if it has to fit 
certain conditions. If the geometry of what is happening gets more complicated then it is 
often easier to take this approach. 

462 Complex numbers 



example (1) I will start by showing you how we could have used algebra to solve the 
first example that we looked at in Section lO.E.(b), so that you can see 
how we get the same answer. 

We were told that |z - 5\ = 2 and we had to find the path which z 
could take in order to fit this condition. 

We start by putting z = x + jy which gives us \x + jy - 5\ = 2. 

Next, we tidy up the x + jy - 5 to show the real and imaginary parts 
separately and clearly. Doing this gives us 

|(x-5)+y>| = 2. 
Therefore we can say that 

V(x-5) 2 +j 2 = 2 
using the rule that \a + bj\ 



(1) 

= \ja 2 + b 2 from Section lO.A.(c). 



^^^^ Don't be tempted to think this should be \ (x - 5) 2 - y 2 = 2. 

^B-^V Remember that the j is telling you that the y is measured vertically. We then 

use Pythagoras' Theorem on the right-angled triangle shown in the sketch in 

Figure 10.E.7 to get |w| where w = (x - 5) + jy. 




lw|= 2 

el w= (x-5") + j'lj 



Qcjand diagram /or w 
Figure 10.E.7 



Squaring both sides of equation (1) above gives us (x - 5) 2 +y 2 = 2 2 = 4. 

This is the equation of the circle with centre (5,0) and radius 2, 
which agrees exactly with what we obtained at the beginning of Section 
10.E.(b) for the path of z by using the geometry of the diagram. 

example (2) My second example involves a more complicated situation where it is 
definitely easier to use algebra. 



We will find where z can be in the complex plane if 



z + 2 



z+7 



< 2. 
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From the rule for dividing complex numbers, we know that 
hi 
hi 



Z2 



so here we have 



\z + 2\ 



which means that 

U+A ~ 

This gives us 

|z + 2| < 2 \z 

It is possible to do this last step because we know that \z + j\ is positive 
since it is a length. Therefore, multiplying both sides of the inequality 
by it does not change the < sign to a > sign. (If k is a positive number, 
and a < b, then ka < kb also.) 

Next, we put z = x + jy and then carefully tidy up to show the real 
and imaginary parts separately and clearly. 

Doing this is very important! 



We get 

|x+y> + 2|<2 \x+jy+j\ 
so \(x + 2)+jy\<2\x+j(y+l)\ 
so ^{x + 2f+y 2 < 2 Vx 2 + 0 + l) 2 . 

Since both sides are positive, the inequality remains true in the same sense 
if we square them both. (If they weren't both positive this would not 
necessarily be so. For example, -3 < 2 but (-3) 2 > (2) 2 .) 
Squaring both sides gives us 

(x + 2) 2 + j 2 < 4(x 2 + (y + l) 2 ). 



Don't forget to square the 2! 



Now we have 

x 2 + 4x + 4 + y 2 < 4x 2 + 4_y 2 + 8>> + 4 
so 0 < 3x 2 - 4x + 3j 2 + 8y or 3x 2 - 4x + 3j 2 + Sy > 0. 
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We then use the method of completing the squares to get the centre and 
radius of the boundary circle. (See Sections 2.D.(b) and 4.C.(d) if 
necessary.) It is easier to complete the squares here if we divide all through 
by 3 to get x 2 and y 2 . 

Doing this gives 

x 2 - \x + y 2 + \y>Q 
so (x - 3> z - 9 + 0 + if - - > 0 or (x - 5) z + 0 + j) ^ T- 

The boundary of the region where z can lie is the circle whose centre is 

2 4 1 / — 

(3,-3) and whose radius is ^20/9 = (2 y5)/3. (If you have any trouble with 
this step, you should go back to Section l.F.(c) for help.) 

Since the form of the inequality tells us that the distance of the point 
representing z from the centre of this circle is greater than or equal to its 
radius, the region where z may lie is the circumference of the circle and 
everywhere outside it. 

I show this in Figure 10.E.8. The centre of the circle is given by the 

2 4 

complex number 3-37. 




example (3) Find the region in the complex plane in which z can lie if 

Im I— ^— I > 2. 

\z + 4/7 



First, we put z = x + jy, giving us 

/ x + jy \ 
Im >2. 

\x + j(y + 4)/ 

Next, we tidy up the fraction 

x +jy 

x+j(y + 4) 

so that we can easily see what its imaginary part is. 



10.E Finding where z can be 



We do this tidying up by multiplying its top and bottom by the 
conjugate of the bottom which gives 

(x + jy) (x -j{y + 4)) _ x 2 + yiy + 4) + j(xy - xy - Ax) 

(x +j(y + 4)) (x -jiy + 4)) x 2 + (y + 4) 2 

-4x 

so Im 
Now we put 



z + 4/7 x 2 + (y + 4) 2 



-4x 

> 2 



x 2 + (y + 4) 2 

to see where z can lie. This gives us 

-4x > 2x 2 + 2(y + 4) 2 . 

(We can do the above multiplication because we know that x 2 + (y + 4) 2 
is positive since x and y are both real.) 
Dividing by 2 gives us 

-2x > x 2 + (y + 4) 2 so x 2 + 2x + (y + 4) 2 < 0 

so (x+ l) 2 + (y + 4) 2 < 1. 

This means that z can lie anywhere on or inside the circle whose centre 
is at the point -1-4/ and whose radius is one unit. Sketch this for 
yourself. 



exercise 10.E.3 Now try these questions for yourself. 

(l) Find the path on which z can lie if 

(a) |z-i| = |z + i|, (b) 



z + 1 



2Z-J 



= 1. 



(2) Find and sketch the regions where z can lie in the complex plane for each of 
the following given conditions. 

(a) \z\<\z-j\ (b) |z|< V2IZ-/I (C) 2|Z|<|2Z-/| 



(3) Solve the equation Re I \ = \ + jz. 

\z + ij 

lO.E.(d) Another example involving a relationship between w and z 

In the examples of Section lO.E.(b) we saw how we could write rules given for a variable 
number z so that they were in terms of a new variable number w instead. I shall finish 
Section 10.E by looking at an example where we are given a relationship between z and w 
which isn't just a simple shift. We shall then use this to find out how making z obey a special 
rule will affect w. 

We will start with the relationship 

w = 

with w = u +jv and z = x + jy. 
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We'll now answer some questions about how this relationship works. 
(1) Can you see a value of z which we must exclude? 



(1) We can't have z = j since this would give us a zero on the bottom of the 
fraction. 

(2) Each value of z will give a corresponding value of w and vice versa. 

What will w be if (a) z = 1 + j, (b) z = 3 + j, (c) z = 3 -p. 
Try answering this yourself, giving your answers in as simple a form as you can 



(2) You should get 

3 + 2j 2 3 

(a) w = 1 + 2j (b) w = = 1 + u' (c) w = . 

3 3-2/ 

Simplifying (c) by multiplying top and bottom by the conjugate 3 + 2j gives 

/ 3 W3 + 2A 9 + 6/ , 6 . 

(3) What are the real and imaginary parts of w in terms of the real and imaginary parts 
of z? 

Putting w = u +jv and z = x + jy gives 

(x+jy)+j x+j(y+l) 

w = u + jv = = . 

(x+jy)-j x+j(y-\) 

Now we simplify this fraction by multiplying its top and bottom by the complex 
conjugate of the bottom. Try doing this for yourself. 



(3) You should have 

+ _ = ( x+j(y+ 1) \ f x -j(y - 1) \ = x 2 + y 2 - 1 + 2jx 
U+JV \x+j(y-\))\x-j(y-l)) x 2 + (y-l) 2 

(4) Now see if you can write down what u and v are in terms of x and y. 



(4) Equating the real and imaginary parts from above should give you 

x 2 + y 2 - 1 2x 
u = and v = 



x 2 + (y - l) 2 x 2 + (y - l) 2 

(5) If we now make the special condition that the real and imaginary parts of w are 
equal, so that w lies on the straight line u = v, what path will z lie on? 

If u = v it must be true that x 2 + y 2 - 1 = 2x since the bottoms of the fractions 
are the same. 

This gives us x 2 - 2x + y 2 - 1 = 0 so (x - l) 2 + y 2 = 2. 



Therefore z lies on a circle whose centre is at (1,0) and whose radius is J2. 
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(5) 



The easiest way to show what is happening geometrically is to draw two separate 
Argand diagrams for each of w and z. 
Doing this gives us Figure 10.E.9 



4"- Im 




Qrejand d t ag Cam for W= u+jv 
with u = v 




Grgand diagram showing 
the. Corresponding pafcK _for z 



Figure 10.E.9 



The little circle on the imaginary axis at +1 in (b) is to show that z = j isn't 
allowed. 

(6) We will finish off this example by finding out how a few points will transfer from 
one Argand diagram to the other. (Notice that none of the points which we found 
in (1) will help us here as u and v are not equal for any of them.) 

We will look first at how two points will move from (b) to (a). 

What is w when (i) z l = -j and (ii) z 2 = 2 + /? 



(6) (i) Ifzj =-j then w x = 0. 
(ii) If z 2 = 2 + j then 

2 + 2j 

W 2 = =1+7. 

2 

Mark these two pairs of points on the Argand diagrams. (You can see that 
z 2 = 2 + j really is on the circle of (b) if you put x = 2 and y = 1 in its 
equation.) 

Now we'll go the other way. It will be easier to do the working out if we start 
by finding out how we can write z in terms of w. We have 

Z+j ■ +■ 

w = SO WZ - WJ = Z +J 

jiyv + 1) 

so z(w - 1) =j(yv + 1) so z = . 

w - 1 

We only want points for which u = v. 

What is z if (iii) w 3 = 2 + ii an ^ w 4 = -\ - j? 
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..... « ii. . 7 (I + \j) 3/ ~ 1 
(in) If w 3 = J + 57 we get z 3 = — j j — = — . 

"2 + 27 7 -1 
Tidying up by multiplying top and bottom by j + 1 , we get 

3/-l\//+l\ -4 + 2/ 



2 



= 2-y = z 3 . 



(iv) w 4 = -l-7 gives z 4 = — = =-5 + s7- 

-2-y \2+j)\2-j) 



Mark these two pairs of points on the two Argand diagrams of Figure 
10.E.9. as well. 

One important application of relationships like this is in the description of 
electric circuits by using complex numbers. Working as we have done above could 
make it possible to find out what happens to, say, the output impedance of a circuit 
from a given variation in the input impedance. 



exercise 10.E.4 Try this example for yourself now. 

If w = u + jv and z = x + j'y and w and z are related by the equation 

z + j 

w = , 

z - 1 

answer the following questions. 

(1) What value of z must we exclude? 

(2) What is w if (a) z = 2 and (b) z = 1 - /? 

(3) Find the real and imaginary parts of w in terms of the real and imaginary 
parts of z. 

(4) If the real and imaginary parts of w are equal, find the path on which z must 
lie. 

(5) Show the paths of z and w on two separate Argand diagrams. 

(6) Find w if (a) z a = -/ (b) z 2 = 2 - / (c) z 3 = 1 - 2/ (d) z 4 = § - \j 
(e) z 5 = \ - \j (f ) z 6 = § - \j. 

Mark each of these pairs of points on your two Argand diagrams. 

(7) If v = 0, so that w lies on the real axis, where must z lie? 

(8) If u = -v what is the path on which z must lie? 

(9) Sketch two diagrams showing the three paths of w from (4), (7) and (8) on 
the first diagram and the corresponding three paths of z from (4), (7) and (8) 
on the second diagram. ... _ 



(10) The three paths of w cut each other at the 
origin. 

The three corresponding paths of z cut each 
other at z = -/'. 

Compare the angles between the paths at 
these two points on your two sketches. 
What do you find? 
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Working with vectors 



In the last chapter, we used both magnitude and direction to represent complex 
numbers in Argand diagrams. Now, we shall extend this idea to discover how we 
can handle physical quantities which have both magnitude and direction, working in 
both two and three dimensions. 

The chapter is divided into the following sections. 

11. A Basic rules for handling vectors 

(a) What are vectors? 

(b) Adding vectors and what this can mean physically, 

(c) Using components to describe vectors, 

(d) Vector components in three-dimensional space, 

(e) Finding the magnitude of a three-dimensional vector, 

(f) Finding unit vectors 

11. B Multiplying vectors 

(a) Defining the scalar or dot product of two vectors, 

(b) Working out the dot product of two vectors, 

(c) Defining the vector or cross product of two vectors, 

(d) Working out the cross product of two vectors, 

(e) Can we multiply three vectors together by using dot or cross products? 

(f) The vector triple product, 

(g) The scalar triple product and what it means geometrically 

ll.C Finding equations for lines and planes 

(a) Finding a vector equation for a line, 

(b) Dealing with lines in two dimensions, 

(c) Dealing with lines in three dimensions, 

(d) Finding the Cartesian equation of a line in three dimensions, 

(e) Another form for the vector equation of a line, 

(f) Finding vector equations for planes, 

(g) Finding equations of planes using normal vectors, 

(h) Finding the perpendicular distance from the origin to a plane, 

(i) The Cartesian form of the equation of a plane, 
(j) Finding where a line intersects a plane, 

(k) Finding the line of intersection of two planes 

11. D Finding angles and distances involving lines and planes 

(a) Finding the angle between two lines, 

(b) Finding the angle between two planes, 

(c) Finding the acute angle between a line and a plane, 

(d) Finding the shortest distance from a point to a line, 

(e) Finding the shortest distance from a point to a plane, 

(f) Finding the shortest distance between two skew lines 



11. A Basic rules for handling vectors 

ll.A.(a) What are vectors? 

Some physical properties, such as temperature or area, are given completely by their 
magnitude and so only need a single number to represent them. Such quantities are called 
scalars. But there are other physical quantities, such as force, velocity or acceleration, for 
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which we must know direction as well as size or magnitude in order to work with them. It 
is often very helpful to represent such quantities by directed lines. Such directed lines are 
called vectors. Because vectors carry the physical information of both magnitude and 
direction, using them gives us a very neat way of handling these quantities. 

It is important to avoid any confusion between vectors and scalars so we must make 
the distinction clear when we write them. Scalars are written with no special marking so 
we write 3 or 16 or A:, say. If a vector runs from O to A, we can write it as OA or print 
it as a single boldface letter, say a. Since we can't do boldface in handwriting, we would 
show that this is a vector by underlining it and writing it as a. I have done this in my 
diagrams. 

Figure ll.A.l illustrates some properties of vectors. 




• Two vectors are equal if and only if they are equal in both magnitude and direction. So 
although a and b are the same length they are not equal because they are in different 
directions. However, the two vectors marked c are defined to be equal although we need 
a shift to move the directed lines exactly on top of each other. 

• Multiplying a vector by a positive number or scalar just has the effect of changing its scale 
or magnitude. So, for example, the vector 2a would be twice as long as a but in the same 
direction as a. 

• Multiplying a vector by a negative number reverses its direction as well. I have shown 
-1 x c = -c in my diagram. 

• The zero vector is the vector which has zero magnitude. Therefore it has no direction and 
so there is only one zero vector. 

• The vector r in the drawing above is called the position vector of the point P from the 
fixed origin O, and describes the displacement of P from O. 



ll.A.(b) Adding vectors and what this can mean physically 

Vectors can have different directions from each other so how can we make sense of adding 
them? Since we can think of vectors as displacements or journeys, to add two vectors we just 
need to find the single displacement which gives the same result as doing the two 
displacements separately. The left-hand drawing in Figure 11.A.2 shows that the single 
vector of p + q is given by doing p followed by q and also by doing q followed by p. Each 
of the two joined together triangles shows what is called the triangle law of addition. 

So, for example, if a boat sets a course to move with velocity p in water flowing with 
velocity q, then it will actually have a resultant velocity of p + q. The third side of the 
triangle gives the speed and direction in which it will move. 
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Figure 11.A.2 



The right-hand drawing could represent two forces P and Q acting on a particle. Their 
joint resultant effect is given by the vector P + Q which is the diagonal of the parallelogram. 
This picture shows what is called the parallelogram law for the addition of forces which 
is simply another way of looking at the triangle law of addition. 

Figure 11.A.3 shows another useful result. 



? 




Figure 11.A.3 



We have the two points P and Q whose position vectors from the origin, O, are p and q. 
The vector PQ gives the displacement from P to Q but we can also move from P to Q by 
doing PO followed by OQ. This gives us the result 



PQ = -p + q 



We can show the addition of any number of vectors by putting them nose to tail and 
seeing what the final displacement is. Figure 1 1 .A.4 shows that the successive displacements 
of a, b, c, d and e are equivalent to the single displacement of r. It demonstrates the vector 
sum a + b + c + d + e = r. 
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The addition works in exactly the same way if the vectors shown are being placed nose to 
tail in three dimensions. You can add the vectors in any order you choose. The result is the 
same. If the vectors join up so that there is no gap between the tail of the first vector and the 
head of the last one then their vector sum is zero. So, for example, if the vectors a, b, c, d, e and 
-r were representing forces acting on a particle then there would be no net force acting on it. 

We now know how to add vectors but how would we subtract them? If we have the vector 
c then -c is defined to be the vector having the same magnitude but the reverse direction 
to c. Subtracting c is the same as adding -c. 

Here are three examples in which we use the rule for adding vectors. 

example (1) A man needs to swim across a river to a point on the far bank 



immediately opposite to him. His maximum speed in still water is 
1 m s _1 and the current is flowing at 0.5 m s~ l . At what angle to the 
near bank should he strike out in order to achieve his aim? If he can 
maintain his best speed, and if the width of the river is 25 m, how long 
will it take him to swim across? 

We start by drawing a sketch. This is always a very important first 
step to understand what is happening. 



Figure 11.A.5 

Figure 11. A. 5 shows how the swimming speed in still water and the 
current need to combine to give the required resultant velocity. It would 
not work for the swimmer to strike out directly for the opposite bank as 
the current would push him downstream. If v stands for the speed at 
which he is swimming, then using Pythagoras' Theorem on the triangle, 
we have 1 = 0.25 + v 2 so v = 0.866 to 3 s.f. 
His angle 0 with the near bank is given by 0 = cos -1 (1/2) = 60°. 
The time taken for him to swim across the river is (25/0.866) = 28.9 s 
to 1 d.p. 



Three forces, P, Q and R, act on a particle so that it is in equilibrium. 
The angle between the lines of action of P and Q is r, the angle 
between Q and R is p and the angle between R and P is q. How are the 
magnitudes of the forces related to these angles? 

I will use the letters P, Q and R to stand for the magnitudes of each 
of the three forces. By their magnitude I mean their size only. Since the 
particle is in equilibrium, the vector sum of P, Q and R is zero, and 
so the three vectors can be represented by the three directed sides of 
a triangle with lengths of P, Q and R. I show this in Figure 11.A.6. 
P followed by Q followed by R brings you back to where you started 
from, giving zero displacement. 




example (2) Lami's Theorem 
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Figure 11.A.6 



The angles between the lines of action of the forces are the three 
exterior angles of the triangle. So now we can use the Sine Rule on this 
triangle (see Section 4.B.(a) if necessary). This gives us 

P _ Q _ R 

sin(180°-/7) sin(180°-g) sin(180°-r)' 
But, for any angled, sin(180° -A) = sin A, so we have 



p 


Q 


R 


sin p 


sin q 


sin r 



This useful result is called Lami's Theorem. 

example (3) The ratio theorem 

Suppose you have two points P and Q with position vectors p and q 
with respect to an origin, O. Suppose there is a point K lying on the line 
PQ, and between P and Q, so that it divides PQ in the ratio PK : KQ = 
I : m. We say that K divides the line PQ internally. We'll now find the 
position vector k of K in terms of p and q. 

I've shown a sketch of what we are told in Figure 1 l.A.7. 



? 




Figure 11.A.7 
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We have k = p + PK but PK = 1/(1 + m) PQ and PQ = -p + q. So 

(/ + m)p + /(-p + q) mp + Iq 

k = = . 

I + m I + m 



If the point K divides the line PQ internally in the ratio / : m and 
the position vectors of P, Q and K with respect to O are p, q and 
k, then 

mp + Iq 

k = — . 

/ + m 



Notice that / and m get flipped over so that / goes with q and m goes with 
p. This is the same as what happens with p and q in the formula for the 
coordinates of a point which divides a line in the ratio p : q in Section 
2.B.(i). The main difference is that this working applies equally well to 
three-dimensional vectors. 

A useful particular result from the above formula is that, if K is the 
midpoint of PQ, we get k = j(p + q). 

Try these questions for yourself now. 

exercise 11.A.1 (1) A particle is in equilibrium under the action of three forces of 7N, 10N and 13N. 

What is the angle between the lines of action of the two largest forces? 

(2) In triangle ABC, the sides of the triangle are given as vectors by AB = p, BC = 
q and CA = r. D, E and F are the midpoints of BC, CA and AB respectively. A 
straight line joining the vertex of a triangle to the midpoint of the opposite 
side is called a median. The medians of this triangle, also given as vectors, 
are AD, BE and CF. Find AD + BE + CF in terms of p, q and r and simplify your 
answer as far as possible. 

(3) The position vectors from the origin of A, B and C in triangle ABC are a, b and 
c respectively. BM is a median of the triangle. Find the position vector g of the 
point 6 which is 2/3 of the way down BM from 6. What does your answer tell 
you about the three medians of the triangle? 

(4) Just before this exercise, we found a formula for the position vector of a point 
which divides a line internally in the ratio / : m. Now suppose that instead the 
point K divides the line PQ so that PK/KM = l/m with K lying on the other 
side of Q from P. We say in this case that K divides the line PQ externally. 
Since we are changing direction as we move from P to K and back from K to 
Q, we say that the ratio is / : -m. Show that, with this adjustment in sign, the 
formula for the position vector of K is unchanged. 

It would be very complicated if any problem involving vectors could only be solved by 
drawing a diagram and then working out its geometry. We now make vectors much easier to 
handle by finding out how to write them in component form. 
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ll.A.(c) Using components to describe vectors 

We want a non-visual way of working with vectors so we don't have to draw them to work 
with them. We'll first think how we could do this if all the vectors lie in the same plane 
so that we are in two-dimensional space. If we could describe each of these vectors in 
terms of some chosen reference vectors, then we could work out how they affect each 
other by using algebra. For example, in Figure 11. A. 8, I have chosen g and h as my two 
reference vectors. Using them, we can build up a grid which makes it possible to describe 
the position vectors with respect to the origin of any points in the plane as multiples of 
g and h. 




Figure 11.A.8 



You can see that the position vectors from O to the points P, Q and R are described 
by 

p = g + 3h, q = (5/2)g + h and r = -g + 2h. 
As an example of working with these vectors, we have 

q + r = [(5/2)g + h)] + (- g + 2h) = (3/2)g + 3h. 

Check that this will work geometrically on the grid. 

We can describe any point in the plane using multiples of g and h. In other words, if we 
choose any point P with position vector r then we can write r = ag + bh where a and b are 
numbers whose value depends on the particular position of P. This includes the possibility 
of irrational numbers so, for example, we could have r = V3g + 2h. 

You can choose any pair of vectors to describe the positions of all the points in this plane 
in this way just so long as your two chosen vectors aren't parallel to each other. Because 
neither g nor h can be written in terms of the other, mathematicians call them linearly 
independent. Any two non-parallel vectors in the plane will be linearly independent. 

In practice, there is a huge advantage in choosing two vectors that are perpendicular to 
each other, and making each of them one unit in length. (Vectors with unit length are called 
unit vectors.) We call these i and j, with i running along the x-axis and j running up the 
j-axis. 

Then we can write the position vector r of any point P as r = a\ + bj. The particular 
numerical values given to a and b describe where P is. 

So, for example, in Figure 1 1.A.9 we have p = 3i + 2j, q = 2i - j and r = - i - j. 
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Figure 11.A.9 

Sometimes it is useful in physical problems to be able to split a vector up into two 
components. For example, we might want to split a force into its horizontal and vertical 
components. Or, given a velocity V with known magnitude and direction, you might be 
asked to find X and Y where X and Y are the components of V due east and due north 
respectively. I have shown this in Figure 11. A. 10. 



Figure 11.A.10 

We have V = X + Y as a vector sum. V, the magnitude or numerical size of V, is 
represented by the length of the hypotenuse of the triangle. 

If we know V and the size of the angle 9, how can we use these to find X and Y, the 
magnitudes of X and Y? 

To do this, we use the trigonometry of right-angled triangles. (See Section 4.A.(a) if 
necessary.) 

We have sin 9 = YIV soY=V sin 9 and cos 9 = XIV so V cos 9 = X. 
If we know both V and 9, we can now find X and Y. 

Similarly, if we are asked this question the other way round, so that we need to find the 
magnitude and direction of V from its components X and Y, we can do this in two steps. 
Firstly, we can find the magnitude V of V by using the known magnitudes X of X and 



Now, by Pythagoras' Theorem, we have V 2 = X 2 + Y 2 so V = \]X 2 + Y 2 . This tells us V, 
the magnitude of V. This magnitude can also be written as |V|. 

Secondly, we can say that tan 9 = YIX so 9 = tan -1 YIX and this gives us the direction 
of V. 

Try these questions now. 




7 of Y. 
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exercise 11.A.2 (1) A ship is sailing on a course of N 62 0 E at a speed of 42 knots. (Because we 

know the direction as well as the magnitude of its speed, we know its 
velocity.) What is the component of its velocity due east? What is the 
component of its velocity due north? Draw a sketch showing the course of 
the ship and its vector components. 

(2) A particle is acted upon by the three forces P = 21 — 3j, Q = 41 + j and S = 
-3i + 2j. What is the resultant force R which acts on the particle? Draw a 
sketch showing P, Q, S and R. 

(3) If f = 2i + j and g = i + 4j and h = 4! + 2), is it possible to find multiples 
of the pairs of vectors (a) f and g, (b) g and h, and (c) f and h to write 
down the position vector relative to the origin of any point in 
two-dimensional space? If it is possible, describe in each case how you 
would combine that pair of vectors to write the vector r = zji + 9]. If it is not 
possible, why is this? 



ll.A.(d) Vector components in three-dimensional space 

We saw in the previous section how we could use two vectors to form a two-dimensional grid 
and then find a combination of multiples of these two vectors to describe the position vector 
of any point relative to the origin. Similarly, if we are working in three dimensions, we start 
by choosing any three vectors which can be rearranged to form a three-dimensional grid. We 
can then describe the position vector of any point from the origin by taking a combination 
of multiples of these three vectors. 

For working in two dimensions, we found it most convenient to use the pair of unit 
vectors i and j running along the x- and j^-axes respectively. When we move into three 
dimensions, we add a third unit vector k which runs along a z-axis which is perpendicular 
to both the x- and y-axes. It is usually most convenient to have the z-axis running vertically 
upwards. I show this in Figure ll.A.ll. 




Now we can write the position vector r of any point P in space in the form r = ai + b\ 
+ ck with particular numerical values of a, b and c for each chosen r. 
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We could have drawn the z-axis in two ways to make it be perpendicular to the 
x- and y-axes. In physical applications, we always choose the direction of the 
z-axis so that x, y and z fit to the thumb, first finger and second finger of the 
right hand. We then get what's called a right-handed coordinate system. You'll 
see that my axes fit this rule and that your left hand would give a mirror image. 

Each of these axes can be extended in its negative direction also, so that we 
can represent vectors such as r = -3i - 2j or r = 2i - j - 5k. 



Adding vectors is extremely easy if you know their components. All you have to do is to 
find the total for each separate component. 

For example, suppose you have two forces P and Q with P = 3i + j -4k and Q = 2i -4j 
+ 2k. Then their resultant R = P + Q = 5i -3j - 2k. 



ll.A.(e) Finding the magnitude of a three-dimensional vector 

If we have a vector in component form, we may need to find its length or magnitude. Figure 
1 1.A.12 shows you how we would do this for the vector OP = r = a\ + b\ + ck. We again 
write the length or magnitude of r as |r|. 



A* 




Now, working with the lengths OA, OB and OC we have OA = a, OB = b and OC = c. 
Also, PQ = OC so PQ = c. 

In the right-angled triangle OQA, we have OQ 2 = a 2 + b 2 and in the right-angled triangle 
OPQ, we have OP 2 = OQ 2 + PQ 2 . So OP 2 = a 2 + b 2 + c 2 . The formula for the length OP 
or |r| is the three-dimensional form of Pythagoras' Theorem. 

If r = ai + bj + ck then |r| = ^a 2 + b 2 + c 2 . 



The direction of OP is given by the three angles which OP makes with the x-, y- and 
z-axes. We'll call these three angles a, ji and y. 

OP makes APOA = a with the x-axis and we have 

a a 

cos APOA = cos a = = , 

OP ^a 2 + b 2 + c 2 
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Similarly, we have 



b 

cos APOB = cos /? = = 

^a 2 + b 2 + c 2 

and 

c 

cos L-POC = cos y = . 

Va 2 + 6 2 + c 2 

These are called the direction cosines of OP. 

Notice that the vector cos a i + cos j3 j + cos y k is in the same direction as the vector 
OP, but scaled down by a factor of the length of OP. Therefore it is a unit vector in the 
direction of p and we write it as p. 

Here's a practical application which uses components and magnitude. 
If a body is acted on by three forces P, Q and S, find the resultant force R, and its 
magnitude |R| if P = 3i - 5j + 2k, Q = -2i + 4j + 4k and S = 3i - 3j - 4k. 
We have 

P + Q + S = R = (3 - 2 + 3)i + (- 5 + 4 - 3)j + (2 + 4 - 4)k = 4i - 4j + 2k 

and 

|R| = V(16 + 16 + 4) = 6. 

It is sometimes convenient to write vectors in column vector form. So, for example, we 
would write i, j and k in the form 




More rarely, we use the row vector form. This would give us 
i = (1, 0, 0), j = (0, 1, 0) and k = (0, 0, 1). 

ll.A.(f) Finding unit vectors 

Sometimes, such as when we work with planes in Section ll.C.(h), we need to find a unit 
vector in the same direction as a given vector, u. We call this unit vector fl. 

Here is a numerical example of how the working out goes. Suppose you need the unit 
vector in the same direction as u = 2i - j + 2k. 

The length or magnitude of u is given by ^/ (4 + 1 + 4) = 3. Also, the required vector must 
be parallel to u. So we can get the vector we want by just scaling down u by a factor of 3. 
It is u = J(2i - j + 2k). 

Remember that each of the vectors i, j and k has unit length and so they are unit vectors. 

Now try these questions yourself. 

exercise 11.A.3 (l) What is the magnitude of the vector v = 4i + 3j'? Find a unit vector v which is 

in the same direction as v. Repeat this question for w = 8i - is]. 
(2) Find the length of the vector p = 4i + 3] + 12k. Draw a little sketch like Figure 
11.A.12 so that you can see the three-dimensional Pythagoras' Theorem 
working in this case. Write down a unit vector in the same direction as p. 



480 



Working with vectors 



(3) If the three forces P = 41 + 3j + 7k, Q = 2i + 9j - 7k and R = 41 - 71 + k act on 
a particle, what fourth force is required so that the particle is in equilibrium? 
What is its magnitude? 

(4) Working in two dimensions, write down two non-zero vectors p and q whose 
sum is zero. What is the physical meaning of p + q if p and q are (a) 
displacements (b) forces? 

(5) Find and write down three non-parallel vectors p, q and r in three dimensions 
whose vector sum is zero. Could you use combinations of your three chosen 
vectors to write down the position vector of any point in three-dimensional 
space? If not, why not? 

11. B Multiplying vectors 

In the previous section we found out how we could add vectors by thinking of their sum as 
the single displacement which is equivalent to putting together their successive displace- 
ments. Can we define a way to multiply two vectors together so that we get a useful result 
while taking account of the directions of the vectors as well as their magnitudes? It turns out 
that there are two different ways of doing this and this section describes them both. 

ll.B.(a) Defining the scalar or dot product of two vectors 

This method for multiplying vectors is called the dot product because we write the two 
vectors with a dot between them. Figure 1 l.B.l shows how we define the product a • b of 
the two vectors a and b and also gives a very neat practical application. 




Figure ll.B.l 

The man is pulling the block with a constant force a so that it moves along the horizontal 
ground. The work done in moving the block through a distance b is then given by the 
distance moved multiplied by the magnitude of the component of the force in the direction 
of motion. 

This is |a||b| cos 9 so we define the scalar or dot product as 



a • b = |a||b| cos 6 



where 6 is the angle between a and b when they are placed tail to tail. 

To use the least possible force, we would need to pull horizontally, so that we are pulling 
in the same direction as we want the object to move. Then we would have 8 = 0 and cos 6 
= 1 so that the work done = a*b = |a||b| = the magnitude of the force multiplied by the 
distance moved in the direction of the force. 
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^^^^ Each of the magnitudes |a| and |b| is a number and cos 0 is a number, so a • b 
^H-^V is not a vector but a number or scalar. This is why it's called the scalar product. 

When writing down two vectors multiplied in this way, you must include the 

dot between them. Writing ab is meaningless. 



ll.B.(b) Working out the dot product of two vectors 

We'll start by looking at the simplest cases. What will happen when we multiply pairs of the 
unit vectors i, j and k together, using the dot product rule? Work out for yourself what the 
answers are to i • i, i • j and i • k. 

If you have done this you will see that the complete set of results is as follows: 



i*i = j*j = k«k=l 

and 

i*j = j*i = i*k = k*i = j*k = k*j = 0. 



This is because i, j and k are all one unit in length and are mutually perpendicular. We are 
also using cos 0° = 1 and cos 90° = 0. 

There is a neat way of working out the dot product of any two vectors when they are given 
in component form. To do this we shall need to use the following result. 



(a + b) • c = a • c + b • c for any three vectors a, b and c. 



This is what mathematicians call the distributive law. They have shown that, for this kind of 
multiplication, it doesn't matter whether we do the adding or the multiplying first. 

Now suppose we have two vectors a = a r i + aj + a 3 k and b = 6 x i + + b 3 lL. 

Then a • b = (flji + a 2 j + a 3 k) • {b Y i + bj) + 63k). 

Working out all the nine separate little dot products that we get from multiplying these 
two brackets together and using the results above, gives us the following rule. 



If a = ci]\ + aj + fl 3k and b = + + b 3 k then 
a • b = |a||b| cos 9 = a 1 b l + a 2 b 2 + a 3 b 3 . 



Here are some examples of applications of the dot product. 

example (1) We will show that the vectors a = 3i + 5j -2k and b = 2i - 2j - 2k are 
perpendicular or orthogonal to each other. 

Finding their dot product gives us a neat way of doing this. Using 
the rule above, we have 

a • b = |a||b| cos 6 = (3 x 2) + (5 x -2) + (-2 x -2) = 6 - 10 + 4 = 0. 
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Since neither |a| nor |b| = 0 we have cos 0 = 0 so 0 = 90° and a and b 
are perpendicular. 

You may find it helpful to write your vectors as column vectors 
when you work out a dot product, as the pairs to be multiplied are then 
very conveniently placed. Here, we would have 



(3 x 2) + (5 x -2) + (-2 x -2) = 0. 



You may recognise that this works in the same way as the matrix 
multiplication 



10 + 4 = 0. 



example (2) Find the angle between the vectors a = 2i - 3j + 2k and b = 4i + 2j - 3k. 
Using the method found in Section ll.A.(e) we have 




|a| = V2 2 + (- 3) 2 + 2 2 = Jvj = 4.123 to 3 d.p. and 
|b| = V4 2 + 2 2 + (- 3) 2 = ^29 = 5.385 to 3 d.p. 

Also we have a • b = (2 x 4) + ( - 3 x 2) + (2 x -3) = -4. 

Since a • b = |a||b| cos 6, we now have - 4 = 4.123 x 5.385 x cos 6 
so cos 9 = -0.1802 and 0 = 100° measuring to the nearest degree. 
Notice that the negative sign automatically feeds you the information 
that the angle between the vectors is obtuse. 



If you need to find the angle between two vectors, it's always worth checking 
first whether the vectors are parallel before working out the dot product. For 
example, if you had been given a = 2i + j - 3k and b = 6i + 3j - 9k then b = 
3 a showing that the vectors are parallel and in the same direction, so the angle 
between them is 0°. If we'd had b = -6i - 3j + 9k, so that b = -3a, again the 
vectors are parallel but in opposite directions so the angle between them is 
180°. 



example (3) The position vectors of the points A, B and C with respect to the origin 
O are a = i + 2j + 5k, b = 4i - 3j + 9k and c = 6i - j + 10k 
respectively. Show that the triangle ABC is right-angled and find its 
area. 

^^^^ Students quite often make the mistake of thinking that a, b and c are the sides 
^H-^V of the triangle ABC but they are the position vectors of A, B and C from the 
origin. 

I've shown this in Figure 11.B.2. 



11. B Multiplying vectors 



483 



Figure 11.B.2 



We have to start by working out the vectors which represent AB, BC and 
CA. I will use column vectors for the working in this example. We have AB 
= -a + b, BC = -b + c and CA = -c + a. This gives 



AB = 

BC = 
and 

CA = 



To find CA, you could also have used CA = CB + BA. Check this for 
yourself. 

Now we work out the dot product of AB and BC. We get 





so AB and BC are perpendicular. We struck lucky with our choice of sides 
here! 

The area A of the triangle ABC is given by ||^5||i?C| since it is right- 
angled. 

We have A = + 25 + 16 ^4 + 4 + 1 = 10.6 square units to 3 s.f. 



example (4) Find a unit vector which is perpendicular to the vectors a = 4i + 2j + 
3k and b = 5i - 2j + 6k. 

We'll start by concentrating on finding a vector which is 
perpendicular to the two given vectors. Then we'll convert this into a 
unit vector. 

Suppose the vector n = ui + vj + wk does what we want. Then we 
know that a*n = b*n = 0. This gives us the following two equations. 
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and 




= 0 so 4u + 2v + 3w = 0 











i-i 











= 0 so 5u - 2v + 6w = 0. 



A vector perpendicular to a and b can be any length. You can imagine a 
and b lying in the plane of a table with us looking for any vector which 
is perpendicular to the table. This means that we certainly can't find a 
unique answer for u, v and w. So we look for any vector which will fit 
our two equations. 

Adding them gives 9w + 9w = 0 so one possibility would be to let u 
= 1 and w = -1. Then substituting these values in the first equation 
gives 4 + 2v - 3 = 0 so v = -5. 

Whole numbers are easier to work with so we'll let n = 2i - j - 2k, 
multiplying all our previous values by 2. Now we find n by finding |n| 
and dividing by it. 

|n| = ^4 + 1 + 4 = 3 so n = |(2i - j - 2k). 

Notice that n = -|(2i - j - 2k) would have answered the question 
equally well. In Example (2) in Section ll.B.(d), we'll discover an 
alternative way of finding a vector which is perpendicular to two given 
vectors. 

We shall find the dot product very useful when we are working with 
lines and planes in Section ll.C. 
Now try these questions. 



exercise 11.B.1 (1) Find the angles between the following pairs of vectors, giving your answers in 

degrees to 1 d.p. if they aren't exact. 



(a) 


a = 


2\ 


- 3j" - k and b = 


5i + j + 2k 


(b) 


a = 


21 


- 7j' + 3k and b 


= 6i - 2ij + 9k 


(c) 


a = 


41 


- 6j + 5k and b 


= -3i + 3} + 6k 


(d) 


a = 


31 


- 5j + 2k and b 


= i + j - 3k 


(e) 


a = 


21 


- j + 3k and b = 


: -8i + i\\ - 12k 



(2) The position vectors of the points A, B and C with respect to the origin 0 are 
a = 8i - j + 8k, b = i + 4j + 5k and c = 4i - 3) + 6k respectively. 

Show that the triangle ABC is right-angled and find its area. Try to answer this 
question without looking at Example (3) above. Read it through again before 
you start if you want to. 

(3) Find unit vectors which are perpendicular to the following pairs of vectors: 

(a) a = 4j + 3k and b = 6i + 4j + 7k 

(b) a = -i + 4j and b = 21 - 4j - 3k 

(c) a = 31 + 2j and b = 3j + 4k 
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ll.B.(c) 



Defining the vector or cross product of two vectors 



Is it possible to find a way to multiply two vectors together so that we get a vector result? 
The problem is that, working from the two directions of our starting vectors, we have to find 
some way of using these to give us the single direction of the resulting vector. This problem 
makes it impossible to define this kind of vector multiplication in two dimensions but there 
is a very neat way of solving it in three dimensions. It works like this. 

We first slide the two vectors together, if necessary, so that their tails meet. Now they 
both lie in one particular flat surface or plane. We can use the direction perpendicular to this 
plane to give us the direction of our vector product. We only have to decide which of the two 
possible perpendicular directions to choose. 

Figure 1 1.B.3 shows how we do this to find the vector product of the two vectors a and 
b. I've drawn b both in its original position and also shifted so that the tails of a and b 
meet. 



Figure 11.B.3 

We define the vector product of a and b as 



a x b = |a||b|sin 6 n 



where n is a unit vector perpendicular to the plane in which a and b lie. 

We choose the direction of n so that a, b and n can fit along the thumb, first finger and 
second finger of your right hand. (Check this with your hand and my drawing.) Notice that 
this means that the direction of b x a is given by -n so a x b = -(b x a). Notice, also, that 
the definition means that a x a = the zero vector Oi + Oj + Ok since sin 0° = 0. 

The multiplication sign used to show the vector product is called 'cross'. It is also 
sometimes written using a little upside down v so a a b is the same as a x b. 

The cross product has the following three very useful practical applications. 

Application (1) area 

We'll look at how cross products give the areas of parallelograms and triangles. 

The area of a parallelogram is equal to its base multiplied by its perpendicular height. 

In Figure 11.B.4 we have a parallelogram whose base is |b| and whose perpendicular 
height h is given by h = |a| sin 6 where |a| is the length of the slant side. 

But axb = |a||b|sin0n where n is the unit vector perpendicular to a and b. 
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Figure 11.B.4 

Therefore its area A is given by the magnitude ofaxbor^4 = |axb|. From this result, 
we see that the area of a triangle with base of length |b| and slant side |a| is given by 
j|a x b|. The cross product of any two vector sides of a triangle can be used to find its 
area. 

Application (2) torque 

The vector product gives us a way to define the torque or moment of a force about a point. 
Figure 11.B.5 shows how this works. 




Figure 11.B.5 



If we have a force F acting through a point P with position vector r with respect to O, 
then F and r lie in a plane through O. I have drawn n as a unit vector perpendicular to this 
plane and I have also redrawn the force vector F shifted so that its tail is at O. The dashed 
line shows the line of action of the force F. 

The torque or moment of F about an axis through O perpendicular to this plane is given 

by 



torque = T = r x F = |r||F| sin 0 ft. 



You can think of |T|, the magnitude of T, in two ways. We have |T| = |F| multiplied by the 
perpendicular distance of O from the line of action of F or we also have |T| = |r| multiplied 
by the component of F perpendicular to r. I've shown both these interpretations on my 
diagram. 

Torque measures the turning effect of F about the axis through O perpendicular to the plane 
in which r and F lie. It is independent of the position of P on the line of action of F. You can see 
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that this must be so from Figure 1 1 .B.6, looking down at the plane containing the line of action 
of F and the origin and showing the position vectors of three different choices of point on this 
line of action. (Again I've shown the line of action of F as a dashed line.) 




Figure 11.B.6 



We have |p| sin P = |q| = |r| sin J?, so the torque of F about the perpendicular axis 
through O is independent of the point chosen on the line of action of the force. 

You can also see that the value of sin 6 does indeed give the various practical possibilities, 
such as when a person applies torque using a spanner to turn a nut. You get maximum turn 
if you pull it at a right angle with 6 = 90° and sin 9 = 1. If you pull along the line of the 
spanner then 0 = 0° and sin 9 = 0 and you get no turn at all. 

Application (3) angular momentum 

The vector product also gives us a way to define angular momentum. In Figure 1 1.B.7, I've 
shown a particle of mass m with position vector r relative to an origin O. If this particle has 
velocity v then its momentum p is given by p = m\. 

The angular momentum or moment of momentum of the particle is defined as 



angular momentum = r x p = r x my = m(r x v). 



angular momentum 

A 




Figure 11.B.7 

I've drawn n as a unit normal vector to the plane in which r and v lie. We have r x p = 
mix X v) = w|r||v| sin 9 n. 
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These three applications show that not only does this definition of vector multiplication 
make sense mathematically but it also has extremely useful physical meanings. 

ll.B.(d) Working out the cross product of two vectors 

Again we'll start by looking at the simplest cases. What answers do we get if we work out 
the cross products for the different possible pairs we can choose from the unit vectors, i, j 
and k? Work out for yourself what i x i, i x j and j x i give. 

If you have done this, you will see that the complete set of results will be as follows: 



ixi = jxj = kxk = the zero vector. 

(Remember that the zero vector is the vector Oi + Oj + Ok.) 

Also, i x j = k and j x k = i and k x i = j 

while j x i = -k and k x j = -i and i x k = -j. 



This is because sin 0° = 0 and sin 90° = 1 and each vector is of unit length. You can see how 
the various plus and minus signs come by using the right-hand rule for each product. 

Next we see how we can work out the cross product of two vectors by using their 
components. Fortunately, mathematicians have shown that the distributive law is true for the 
cross product. That is, if we have three vectors a, b and c, then (a x c) + (b x c) = (a + b) 
x c. 

Now suppose we have a = a x \ + + 03k and b = b y \ + b 2 \ + 63k so that a x b = 
(flji + aj + a 3 k) x + + 63k). Working out all the nine separate little cross products 
that we get from multiplying these two brackets together and using the distributive law from 
above gives us the following rule. 



If we have two vectors a = a x i + a 2 \ + a 3 k and b = b x \ + bj} + b 3 k 

then their vector or cross product is given by 

a x b = (a 2 b 3 - a 3 b 2 )i + (a 3 b l - a^j + (a x b 2 - a 2 b x )k. 



It can be helpful when working out a vector product to set out the two vectors in the form 
of what is called a determinant. To do this we write 



a x b = 



i j k 

a 1 a 2 a 3 
b x b 2 b 3 



The rule for multiplying out this determinant is that we start from the top row and multiply 
down, taking an element from each row in all possible ways except that no element is 
allowed to be directly underneath another. The sign of each multiplication depends on the 
order in which the components are chosen. I've shown how the working out goes by writing 
the determinant in a double form in Figure 11.B.8. 
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<j k 



i k 



V\ X. / / 



a., a. 



V 3 '< 



a. <x z 



Figure 11.B.8 



We put together all the diagonal multiplications with the forwards slanting ones being 
positive and the backwards slanting ones, shown with dashed lines, being negative. This 
gives us the answer 

a x b = i(a 2 b 3 ) - \{a 3 b 2 ) + j(a 3 ^i) - j(«i^ 3 ) + kOi^i) - k(a 2 *i) 
which, tidied up, is 

a x b = (a 2 b 3 - a 3 b 2 )\ + (a 3 b l - a l b 3 )) + (a x b 2 - a 2 fe 1 )k 
as we found before. 



exercise 11.B.2 Work out the cross product a x b for each of the following pairs of vectors a and b. 

(1) a = i + j and b = j + k 

(2) a = 31 + j + 2k and b = si + k 

(3) a = 2i - 7j + 3k and b = i - 2] - k 

(4) a = i - 2] - k and b = 21 - j] + 3k 

(5) a = si - 2k and b = 31 + 4k 



Here are two more examples which make use of the cross product. 



example (1) A force F = 3i + 2j + 4k acts through the point with position vector 
r = 2i + j + 3k. What is its torque about an axis through O which is 
perpendicular to both r and F? 

The torque = rxF = (lx4-3x2)i + (3x3-2x4)j + (2x2- 
1 x 3)k = -2i + j + k. The method is the same in whatever system of 
units you are working. This is very much easier than trying to work out 
the moment of a force in three dimensions using geometry. 



example (2) Find a unit vector which is perpendicular to the two vectors 
a = 4i + 2j + 3k and b = 5i - 2j + 6k. 

Working out a X b will automatically give us a vector which is 
perpendicular to both a and b. To do this we write 



a x b = 



i j k 

4 2 3 
5-2 6 



= i(12 + 6) + j(15 - 24) + k(-8 - 10) = 18i - 9j - 18k. 

Dividing by 9, the vector v = 2i - j - 2k will work equally well. But we 
want a unit vector so we must divide by the length of this vector. Now 
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I v I = y]4 + 1 + 4 = 3 so the required unit vector is v = l/3(2i - j - 2k). 
This method is easier than when we used the dot product in Example 
(4) in Section ll.B.(b) to find this same vector. We shall need to find 
unit vectors which are mutually perpendicular to two given vectors when 
we work with equations of planes in Section ll.C.(h). 

exercise 11.B.3 (l) Find unit vectors which are perpendicular to each of the following pairs of 

vectors. 

(a) a = 4j + 3k and b = 6i + 4j + 7k 

(b) a = -i + 4j and b = 21 - 4j - 3k 

(c) a = 31 + 2j and b = 3] + 4k 

(d) a = 3i + 2j + 2k and b = -3i + j + 4k 

(e) a = 2i + 2k and b = 2i + 4j - 5k 

(f) a = 4i + 3] + 2k and b = —41 + 3j 

(2) The three forces F 2 and F 3 all act on a rigid body. 
Fj = 2i - j + 3k, F 2 = 3i + 4j - 4k and F 3 = i - 2j + 2k. 

F 1 acts through the origin, F 2 acts through the point with position vector 
i + 2] + k, and F 3 acts through the point with position vector i - 3k. 
Find their resultant force, R. Find also the total moment of these three forces 
about the origin. The different lines of action of the forces don't affect the 
calculation of R but we have to know them to calculate the moment of the 
three forces about 0. 

(3) If a = xi + y'\ + zk, b = 2i - j + k and c = i - 3j — 5k and a x b = c can you 
find the values of x, y and z? 

(4) Starting with the vector u = 21 - j - 2k, find three mutually perpendicular unit 
vectors, with one of them in the direction of u. (Hint: think what is happening 
geometrically here. There are an infinite number of vectors perpendicular to u 
and any of these will do for your second vector. Also it's easier to find the 
mutually perpendicular vectors first and then convert them into unit vectors. 
Perpendicular vectors are also called orthogonal vectors.) 

ll.B.(e) Can we multiply three vectors together by using dot or cross products? 

Our options are limited because the result of the first multiplication must be a vector 
otherwise the second multiplication isn't defined. So, for example, we can't work out (a • 
b) • c or c x (a • b). (The brackets tell us which pair to multiply first.) Therefore the first 
multiplication which we do must be a cross product. The second multiplication can then be 
either a cross product or a dot product. We'll look at these two possibilities separately. 

ll.B.(f) The vector triple product 

The multiplication (a x b) x c is an example of what is called the vector triple product. You 
might think that (a x b) x c = a x (b x c) but, in general, this is not so. You can understand 
this if you imagine the vectors a and b lying in the plane of a table. Then d = a x b has a 
direction perpendicular to this table. Since d x c is perpendicular to d, working out d x c puts 
you back in the table again. Similarly, the vector a x (b x c) lies in the same plane as b and 
c. In general, this won't be the same as the plane in which a and b lie. 

Here's a numerical example to show how this works. If a = i + k and b = 2j + 3k and 
c = 2i + j + k, then working out u = (axb)xc gives us u = -5i + 6j + 4k. You can see that u lies 
in the plane of a and b because it can be written as a linear combination of a and b. Check for 
yourself that u = -5a + 3b. (You might need to look back at Section 1 l.A.(d) here.) 
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Similarly, working out v = ax(bxc) gives v = -6i + 3j + 6k and v lies in the plane of 
b and c because v = 3b - 3c. 

exercise 11.B.4 (1) Try working out a similar example for yourself. 

If a = i + 2], b = -j + k and c = 21 + j - 2k, first find u = (a x b) x c and show 
that u lies in the plane of a and b. 

Then work out v = a x (b x c) and show that v lies in the plane of b and c. 
(2) For any three vectors a, b and c, it can be proved that a x (b x c) = (a • c)b - 
(a • b)c. Check that this formula works both for my numerical example before 
this exercise and for your answer to the first question of this exercise. 

ll.B.(g) The scalar triple product and what it means geometrically 

Now we look at multiplications in which we first find a cross product and then find the dot 
product of this with some third vector. So, for example, we might find (a x b) • c. 
Multiplications like this are called scalar triple products. The working out will always result 
in a scalar or number. 

The scalar triple product has a useful geometrical interpretation. To show you how this 
works, we'll take the numerical example ofV= (2i x 3j) • 4k. This particular scalar triple 
product gives the volume of a rectangular box. Now we will see why this is so. 

First, we work out the cross product of 2i X 3j = 6k. Figure 1 1.B.9 shows you that this 
6k gives the size of the area of the base of the left-hand box, but as a vector in the k 
direction. 




Figure 11.B.9 

Next we work out the dot product of 6k • 4k = 24, which gives us the actual volume of 
the left-hand box. 

Similarly, (3j x 4k) • 2i gives the volume of the right-hand box. This is the same size but 
we are starting with the left-hand face to find its volume. We have 

(3j x 4k) • 2i = 12i • 2i = 24. 

In general, given three vectors a, b and c, we can always find the scalar triple product of 
(a x b) • c and the answer will be a number, not a vector. 
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Also, (a x b) • c = a • (b x c). Both of these multiplications give the volume of the slant- 
sided box made by sliding the three vectors a, b and c together so that their tails meet. (This 
box is called a parallelepiped.) 

I show how this works in Figure 1 1 .B. 10. The volume of this box is given by the area of 
its base multiplied by its perpendicular height. I've shown the base separately on the right- 
hand side of the diagram. 




Figure 11.B.10 

We have a x b = |a||b| sin J? n = Ah where n is a unit vector perpendicular to the 
parallelogram with sides given by a and b, A is its area and R is the angle between a 
and b. 

So (a x b) • c = Ah • c = J|n||c| cos S where 5* is the angle between c and n. 

But | n | = 1 and |c| cos S = h which is the perpendicular height of the box. 

Therefore (axb)*c=ili which is the volume of the box whose edges are the vectors 
a, b and c. Equally, (b x c) • a and (c x a) • b give the volume of the same box, so we 
have 

(a x b) • c = (b x c) • a = (c x a) • b = c • (a x b) = a • (b x c) = b • (c x a). 

The last three are the same as the first three since the order in which we work out a dot 
product leaves the answer unchanged. However, if we interchange the order of the vectors 
in the cross product, the answer will change in sign since a x b = -(b x a). 

exercise 11.B.5 If a = 31 - j + 2k and b = i - 2j + 2k and c = 21 + 4j - k, work out each of the 

following: 

(1) (a x b) • c 

(2) a • (b x c) 

(3) b • (a x c) 

(4) The three vectors a, b and c all lie in the same plane. What happens when you 
work out (a) (a x b) x c? (b) (a x b) • c? 

ll.C Finding equations for lines and planes 

The results we have found in the previous two sections mean that we shall now be able to 
find vector equations for straight lines and planes. We start by considering lines. 

ll.C.(a) Finding a vector equation for a line 

Using vectors gives us a very neat way of writing down an equation which gives the position 
vector of any point on a given straight line. This method works equally well in two or three 
dimensions. 
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Suppose we've got a straight line like the one I show in Figure ll.C.l. (You have to 
imagine that it extends infinitely far in either direction.) O is the origin and A is a known 
point on the line. P is any general point on the line. 




o 



Figure ll.C.l 

In order to write down the vector equation of this line, we need to know two things. 

• We have to know the position vector of some point which lies on the line. On my diagram, 
we know that OA = a. 

• We have to know a vector which gives the direction of the line, like d in my diagram. This 
is called a direction vector. 

Then the position vector r of any general point P on the line is given by the equation 



r = a + td 



where t tells us how much of d we need to take in order to get from A to P. (t = 2 for the 
particular P I have shown in my drawing.) 

^B^B Notice that writing r = la + d would give you a completely different line! 

™ 

It's important to realise that there are many possible ways of writing the vector equation 
of any given line. So, in my example above, any points on the line would work equally well 
provided we knew its position vector, and any vector lying parallel to d would work equally 
well as a direction vector. For example, I could have used 3d or -d. 

ll.C.(b) Dealing with lines in two dimensions 

If we are working in two dimensions we can think of any line as lying in a plane containing 
the origin where all the vectors can be described in terms of just i and j . It is as though we 
are looking at the line in an infinitely large piece of graph paper. 

Suppose that we have a line L with the equation r = a + td with a = 2i + j and d = i - j. 

Letting t = 0, 1 and -2 in turn, we find that the points with position vectors r = 2i + j, 
r = 3i and r = 3j all lie on this line. I show a sketch of it in Figure 11.C.2. 
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Figure 11.C.2 



Writing any general point on the line as r = xi + yj, and using column vectors, we 
have 



Vectors are only equal if their components are equal so we must have x = 2 + t and y = 



Adding these equations to get rid of t, we find that x + y = 3. This is the Cartesian form 
of the equation of this line. It's now in the form described in Section 2.B.(f). 

We can also work backwards to get a vector equation of a line from its Cartesian 
equation. 

Suppose we have the line 2y = x + 4. First, we find a possible a, the position vector of 
a known point on the line. Any such point will do, so I'll choose the point (2, 3) with position 
vector a = 2i + 3j. 

Now we need a direction vector for the line. Its gradient, working as in Section 2.B.(d), 
is 1/2 so its direction is given by the vector d = 2i + j. 

Therefore we can write its vector equation as r = 2i + 3j + f(2i + j). 

Notice that we have a huge choice as to how we write a vector equation for this line. It 
would have worked equally well with a = -2i + j and d = -2i - j. I've shown both 
possibilities in Figure 11.C.3, the first using solid lines for a and d and the second using 
dashed lines. 




1 - t. 



*3 



? 



-4 




Figure 11.C.3 
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We can see that the point I've marked P, with p = 4i + 4j, lies on this line. 
If we use r = 2i + 3j + t(2i + j) then we put t = 1 to get r = p. 
If we use r = -2i + j + t(- 2i - j) then we put t = -3 to get r = p. 
Looking at the graph you can see that the vectors do actually work like this. 

exercise li.c.i (1) Find a cartesian equation for each of the following lines: 

(a) r = 2i - 3j + r(i + 2j) 

(b) r = i - 2j + r(4j) 

(2) Find a vector equation for each of the following lines: 

(a) y = 3X + 5 

(b) 2y = 5X - 3 

If we have two straight lines in two dimensions then they must either be parallel, in which 
case their two direction vectors must be parallel, or they must cut each other at some 
point. 

Here's an example. Suppose we have three lines L x , L 2 and L 3 and we need to find the 
relationship of L x with each of L 2 and L 3 . 

Line L x has the equation r = i + 3j + t(- i + 2j). 

Line L 2 has the equation r = 2i - j + m(2i - 4j). 

Line L 3 has the equation r = i - 2j + s(2i + j). 
The letters s and m work in exactly the same way for their lines as t does for line L x . 
Lines L x and L 2 are parallel because their direction vectors are parallel since 2i - 4j = 
-2(-i + 2j). 

Lines L x and L 3 aren't parallel so now we'll find where they meet. Suppose that they meet 
at the point P with position vector p. Then, working with column vectors, we have 




Vectors are equal if their components are equal so we have the following two equations. 

l-t=l+2s (1) 
3 + 2t = -2 + s (2) 

From (1) we have t = -2s. Substituting this in (2) gives 3 - 4s = -2 + s so s = 1. Therefore 
t = -2. The two lines meet at the point with position vector p = 3i - j. 

Notice that you can either substitute t = -2 in L x or s = 1 in L 3 to get this result. 

If we tried to find where the parallel lines L x and L 2 meet using this method, we wouldn't 
be able to solve the equations for t and m. See for yourself! 

Try these two questions. 

exercise 11.C.2 (1) Find the point of intersection of the lines r = i + j + sfei - 2j) and 

r = 6i + r( 2 i + j). 

(2) The line L r has the vector equation r = i + 3j + f(2i + 3j). Find the vector 
equation of the line L 2 which is perpendicular to l x and which passes through 
the point whose position vector from the origin is r = 3i + 4j. 



496 



Working with vectors 



ll.C.(c) Dealing with lines in three dimensions 

The general equation for a straight line of r = a + td is unchanged but now both a and d are 
three-dimensional vectors. Here are some examples of equations of particular lines so we 
can look in more detail at how they actually work. 

Suppose that Line L x has the equation r = i + 3k + t(2i + j + k) so that the vector a in 
Figure ll.C.l is i + 3k and the vector d is 2i + j + k. 

Different values of t give the position vectors of different points on this line. 

Putting t = 0 gives r = i + 3k. 

Putting t = 1 gives r = 3i + j + 4k. 

Putting t = -\ gives r = — i — j + 2k. 
Suppose there are two more lines L 2 and L 3 so that now we have the following three 
lines. 

L x has the equation r = i + 3k + ?(2i + j + k). 

L 2 has the equation r = i + 3k + s(i + 4j - k). 

L 3 has the equation r = i + j + k + m(4i + 2j + 2k). 
The letters s and m work in exactly the same way for their lines as t does for line L x . 
Both the lines L 2 and L 3 have special relationships with line L x . Can you spot what they 
are? 



Comparing lines L x and L 2 with the general form of the equation for a straight line of 
r = a + td, we see that both lines have the same position vector of a = i + 3k but they have 
different direction vectors. Therefore they cut each other at the point with position vector 
r = i + 3k. 

Line L 3 has the same direction as line L x since its direction vector is just scaled up by a 
factor of 2. So either lines L x and L 3 are parallel or they are really the same line. 

Putting t = 0 for line L x gives r = i + 3k but there is no value we can give to m in line 
L 3 which would make r = i + 3k. Therefore L x and L 3 are distinct parallel lines. 

Now we'll consider two more lines. 

Line L 4 has the equation r = i - j + 4k + s(i - j + k). 
Line L 5 has the equation r = 2i + 4j + 7k + t(2i + j + 3k). 

They are not parallel since their direction vectors aren't parallel but do they cut each 
other? If not, they are what are called skew lines. 

If they do cut each other then the point P where they cut must lie on both lines. We'll call 
its position vector from the origin p. 

The point P can only exist if there are values of s and t so that 

p = i - j + 4k + s(i - j + k) = 2i + 4j + 7k + t{2\ + j + 3k). 
The working is easier if we write this equation using column vectors. Doing this gives us 



P = 



For this equation to have a solution, each of the three components of the vectors must be 
equal, so we get 




1 


+ 5 = 2" 


1- It 


(1) 


-1 


- s = 4 4 


- 1 


(2) 


4 


+ s = 7- 


h 3? 


(3) 
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Is this possible? If we add (1) and (2) we get 0 = 6 + 3t so t = -2. Substituting this value 
in (1) gives s = -3. 

The next step is very important. The lines only meet if these values of s and t also fit 
equation (3). 

Substituting s = -3 and t = -2 in 4 + s = 7 + 3t gives the LHS = 4 - 3 = 1 and the RHS 
= 7 - 6 = 1 . Therefore the three equations are consistent (that is, there is a solution which 
fits all three of them) and the lines do cut each other. 

Now, putting s = -3 we find that the position vector p of the point of intersection P is 
p = -2i + 2j + k. You will see that putting t = -2 gives exactly the same result. 

Try some for yourself now. 

exercise 11.C.3 Find whether the following pairs of lines are (a) parallel, (b) non-parallel and 

intersecting, or (c) non-parallel and non-intersecting and therefore skew. 

(1) r = i + 2j - k + s(2i - j + 3k) and r = 5! — j + f(6i - 3j + 9k) 

(2) r = -i - j + 5k + s(2i - 3k) and r = 41 - 4j + 2k + f(i - j) 

(3) r = 2i + k + s(i + 3j + 4k) and r = i + 3j + r(2j + k) 

(4) r = i + 3j + 2k + s(i - j + 2k) and r = 2i + 6j + 4k + r(i - 3j + 2k) 

(5) r = i - 2j + k + s( - j + 2k) and r = 3i - 6j + 3k + f (21 + 2j - k) 

ll.C.(d) Finding the Cartesian equation of a line in three dimensions 

We know that an equation like 3x + Ay = 12 describes a line in two-dimensional space. Will 
it also describe a line in three-dimensional space? In Example (2) of Section ll.C.(i) we 
shall show that it actually describes a plane. So, how would we write the equation of a line 
in three dimensions in terms of x, y and z? 

To show you how the working goes, I'll take the particular example of the line whose 
vector equation is r = 2i + 3j - 4k + r(3i - j + 2k). 

Since r is standing for the position vector from O of any point on the line, we'll write 
r = xi + y\ + zk. The values of x, y and z vary as the point P moves on the line. So now we 
can say 

r = xi + y\ + zk = 2i + 3j - 4k + t (3i - j + 2k). 
The working is easier if we write this equation using column vectors. This gives 











f I 




- 



















This equation can only be satisfied if it is true for each separate component. We have 

'x = 2 + 3? (1) 
■ y = 3-t (2) 
. z = -4 + It (3) 

Rearranging each equation for t gives us 

x-2 y-3 z + 4 

= t and = t and = t. 

3 - 1 2 
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Therefore we can say 

x - 2 z + 4 

= 3-y = . 

3 2 

This is the equation of the line in terms of x, y and z (also called its cartesian equation). So, 
if x = 5 for example, we would have 1 = 3- ysoy = 2 and z + 4 = 2soz = -2 which means 
that the point with position vector r = 5i + 2j - 2k lies on this line. Looking back at the 
equation of the line written in column vectors above, we can see that t = 1 gives this 
point. 

Working in exactly the same way, we get this general rule. 



If a = aji + a 2 \ + a 3 k and d = d]i + dj} + d 3 k 

then the equation of the line r = a + td can also be written as 















- 






f d) 






\ a 3 ) 




\dj 



This rearranges to give the Cartesian equation of the line as 

x - a x y - a 2 z - a 3 
d x d 2 d 3 



Notice how the components of the position vector a and the direction vector d appear in the 
Cartesian equation. 

If we are finding the Cartesian equation of the line r = a + td, and either a or d or both 
have components equal to zero, the working can be a little tricky. 



Example (i) Suppose we have r = 3j + t(- 4i + 3j). 

Taking r = xi + yj + zk as before, and working with column vectors, 
gives us 















- 


• 




3 










I 0 



From this, we get the three equations 

' x = -At (1) 
j = 3 + 3f (2) 
z = 0 (3) 

Rearranging the first two equations gives 



x y - 3 

— = t and = t. 

4 3 
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So the Cartesian equation of this line is given by 

x y - 3 

- — = and z = 0. 

4 3 

So, if x = 8 for example, we have y - 3 = -6 so y = -3. 

The point r = 8i - 3j lies on this line. We can see that we get the 

position vector of this point when t = -2. 

Next we have two examples showing the reverse process of converting the Cartesian 
equation of a line into its vector equation. 

Example (2) Suppose we have the line with the equation 

x - 5 j + 3 2 - z 

2 4 3 

Then we choose t so that 

x — 5 y + 3 2 — z 

'2 4 3 

Then, working with t and each of the x, y and z expressions in turn, we 
get the three equations below for x, y and z. 

x = 5 + 2? (1) 
y = -3 + 4t (2) 
. z = 2 - 3t (3) 

Writing these equations as components in column vectors gives us 











1 2 




- 



















which is the vector equation of the line. 

This can also be written as r = 5i - 3j + 2k + t (2i + 4j - 3k). 

Example (3) Suppose we have the line with the equation 

x-3 

= z and y = -\. 



4 

x-3 

Letting t = = z gives us At + 3 = x and t = z. 

4 

We also have y = -I. 

Writing these equations as components in column vectors gives us 











< 4 




- 






0 













which is the vector equation of the line. This can also be written as 
r = 3i - j + t(4i + k). 



Now try these questions yourself. 



500 



Working with vectors 



exercise 11.C.4 (1) Find the Cartesian equation for each of the following lines. 

(a) r = i - 2j + 5k + f (31 + 2] - jk) 

(b) r = 31 + 5k + f (21 - j + k) 

(c) r = 2] - 3k + f(i - k) 

(2) Find the vector equation for each of the following lines. 

, , *-5 y+ 3 * 

(a) = = - 



y- 2 

(b) x = = z + 1 

3 

(c) 3X = 5 - y and z = o 

(3) A straight line in three-dimensional space passes through the points A and B 
with position vectors relative to the origin of a = 4i and b = 3]. 

(a) Find its vector equation. 

(b) Find its Cartesian equation. 

ll.C.(e) Another form for the vector equation of a line 

We can use the cross product to write the vector equation of a straight line in the following 
way. From Figure 11.C.4 (which is the same drawing that I used in Section ll.C.(a)) we 
have AP = -a + r and d is in the same direction as AP. 




o 

Figure ll.C 4 



Therefore, d x AP = 0, the zero vector. This gives us the alternative form for the equation 
of a straight line of (r - a) x d = 0 = Oi + Oj + Ok. 

ll.C.(f) Finding vector equations for planes 

One possible way of doing this is to use a very similar method to the one we used in Section 
1 1 .C.(a) to find the vector equation of a line. The difference is that now we want an equation 
which gives the position vector of any point in a flat surface or plane. We'll start with the 
case where the origin lies in the plane. Figure ll.C. 5 shows part of a plane like this. 

We've now got a very similar situation to the one described in Section ll.A.(c) except 
that this plane is a two-dimensional space embedded in a three-dimensional space. To find 
the position vector of any point P, we have to know two non-parallel vectors which lie in the 
plane. I have called these d and e. It is then possible to get to P by adding together suitable 
multiples of d and e. This gives us the equation of the plane as r = sd + ?e where the values 
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Figure 11.C.5 



of the numbers s and t can be chosen to give us the position vector of any point in the plane. 
In Figure 11.C.5, r is given by s = 1.4 and t = 1.1 approximately. 

Now suppose we have a plane which doesn't pass through the origin. Figure 1 1 .C.6 shows 
part of a plane like this. Again, d and e are known vectors which lie in the plane. But now 
we also need a way of getting to the plane from the origin, so we have to know the position 
vector of some particular point in the plane. In Figure 1 1.C.6, this point is A with position 
vector a. 




Figure 11. C.6 

Once we have reached the plane, we can find the position of any general point P relative 
to A in the same way that we did above by saying that p = sd + te. (For my particular P, 
s = 1 .2 and t = 1 .) Now we can get the equation of the plane in terms of the known vectors 
a, d and e. 

We have r = a + p but p = sd + te, so r = a + sd + te. 

In the special case where the plane passes through the origin, we can leave out the a 
because we can choose O to be the known point in the plane. If two planes are parallel, then 
the same d and e can be used for both of them, since we can move these vectors so that they 
lie in either plane. The equations of the planes are different because each one must also 
include a position vector from the origin to a known point in that particular plane. 

Writing the equation of a plane in this way has one big disadvantage. There are infinitely 
many directions which vectors lying in the plane can have. Therefore there are infinitely 
many pairs like d and e to choose from. It would be much nicer if we could use a direction 
which is unique to the plane. There is a direction which has exactly this property. Can you 
think what it is? The next section explains how we use it. 
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ll.C.(g) Finding equations of planes using normal vectors 

The direction perpendicular to a plane is unique to that plane (and any plane parallel to it). 
To see how we can use this to give us another form of the equation of a plane, we'll again 
start with the case where the plane passes through the origin. I've shown part of such a plane 
in Figure 11.C.7. 




Suppose we know a vector n which is perpendicular to the plane. This means that n 
is perpendicular to the position vector r of any point in the plane relative to the origin. 
Therefore the dot product of the perpendicular vectors n and r is zero and we can write 
the vector equation of the plane as n • r = 0. The vector n is called a normal vector to 
the plane. Any vector parallel to the n I have drawn will also be a normal vector to 
this plane and will work equally well. In particular, we could have used -n. Provided we 
know two non-parallel vectors which lie in the plane, say d and e, then we can always 
find a vector n by working out the cross product of d and e. (See Section ll.B.(d) if 
necessary.) 

Now we extend this method to find the equation of a plane which doesn't pass through 
the origin. I've shown part of such a plane in Figure 11.C.8. This time, we have to be able 
to get to the plane first from the origin, so we must know the position vector of some 
particular point in the plane from the origin. In Figure 1 1.C.8, this point is A with position 
vector a. 




Figure 11.C.8 



If P is any general point in the plane, so that the vector AP = p, then n and p are 
perpendicular to each other. Therefore n • p = 0 but p = r - a. 
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This gives us the vector equation of the plane as 



n • (r - a) = 0 or n • r = n • a = C 



where C is the number we get from working out the dot product of the two known vectors 
n and a. 

Again, if we know two non-parallel vectors d and e which lie in the plane then we can 
use their cross product, d x e, to find a vector n which is perpendicular or normal to this 
plane. 



ll.C.(h) Finding the perpendicular distance from the origin to a plane 

We can obtain a particularly useful form of the equation of a plane by finding a normal 
vector to it which has unit length. I'll call this unit normal vector n. You can see why it is 
useful by looking at Figure 11.C.9. 



o 




Figure 11.C.9 



Since |n| = 1, we have r • n = |r| cos 6 = D where |Z>| is the perpendicular distance from 
the origin to the plane. We write the distance as \D\ because distances are always taken as 
positive. If n was pointing upwards in Figure 1 1.C.8, the calculated D from r • n would be 
negative since 9, the angle between n and r, would now be obtuse. 

Knowing how to find this distance will make it easier to find some shortest distances in 
Sections ll.D.(e) and (f). It also has applications in computer graphics. 

As a numerical example, we'll find the perpendicular distance from the origin to the 
plane with equation r = 3i + k + s{2\ - 4j + 5k) + t(i - k). 

Comparing this with the general form r = a + sd + fe, we have d = 2i - 4j + 5k and 
e = i - k. 

Working out d x e will give us a normal vector, n, to the plane. We have 



n = d x e = 



i j k 

2-4 5 
1 0 -1 



= 4i + 7j + 4k. 

From this we have |n| = = 9 and the unit vector n = |(4i + 7j + 4k). 
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Now, n • r = n • a = D where \D\ is the perpendicular distance from the origin to the plane, 
and a = 3i + k. Working with column vectors, we have 



n • a = 




The perpendicular distance from the origin to this plane is 16/9 units. 



exercise 11.C.5 Find the equations of the following planes in the form r • n = D and hence find 

the perpendicular distance from the origin to the plane in each case. 

(1) r = 4i + j + s(2i - 4j - 3k) + f(-i + 4j) 

(2) r = 2i - 3j - k + s(3j + 4k) + r(3i + 2j) 

(3) r = 3j + 6k + s(3i + 2) + 2k) + f(-3i + j + 4k) 



Can you explain why you get a rather surprising result for (3)? 



ll.C.(i) The Cartesian form of the equation of a plane 

Starting with the vector equation of a plane in the form n«r = n«a = Citis very easy to 
find the Cartesian equation of the plane. We let r = xi + yj + zk, remembering that r is the 
position vector of any general point in the plane. We let n = ai + bj + ck where, for any 
particular plane, we know the values of a, b and c. Then, using column vectors, we have 













n • r = j 


: 


H 


; 


| = ax + by + cz = C. 













The Cartesian form of the equation of a plane is ax + by + cz = C. 
Here are two numerical examples. 

example (1) The plane IT contains the two vectors d = 3i - 2k and e = j + 2k. The 
point A with position vector a = 2i - j + 3k lies in this plane. Find its 
Cartesian equation and the perpendicular distance from the origin to this 
plane. 

First, we find a normal vector to the plane by working out the cross 
product of d and e. Doing this gives us d x e = n = 2i - 6j + 3k. 
The equation of the plane can be written in the form n*r = n*a = C. 
Working with column vectors, we have 




which gives us the Cartesian equation 2x - 6y + 3z = 19. 

To find the distance of the plane from the origin we must use the 
unit normal vector in the equation of the plane. Here, |n| = ^49 = 7 so 
n = j(2i - 6j + 3k) = yn. Since n = 711 and r • n = 19, we have 
r • n = 19/7 = D. The perpendicular distance of this plane from the 
origin is 19/7 units. 

example (2) The equation 3x + 4y = 12 describes a line in two-dimensional space. 
What does it describe in three-dimensional space? 
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Comparing 3x + Ay = 12 with the Cartesian equation for a plane of 
ax + by + cz = C we see that 3x + Ay = 12 describes the plane with a 
normal vector of n = 3i + 4j. For each pair of values which x and y can 
take to satisfy the equation 3x + Ay = 12, z can take any value and the 
equation will still be satisfied. So, for example, each of the points P, Q 
and R with position vectors p = (0, 3, 0), q = (0, 3, A), and r = (0, 3, -1) 
will lie on this plane. I have used row vectors here to highlight the role 
each number is taking in satisfying the equation 3x + Ay = 12. 

The reason that z can have any value is that the z-axis is parallel to 
the plane 3x + Ay = 12. I've shown part of it in Figure 1 1.C.10. 




Figure 11.C.10 



You can see that the points P, Q and R will all lie on the straight line 
AB in which the plane 3x + Ay = 12 cuts the vertical plane of x = 0. 
Similarly, the line CD is where the plane 3x + Ay = 12 cuts the 
horizontal plane of y = 0. It contains points on the plane with position 
vectors such as (A, 0, 3) and (4, 0, 1). 

Finally, suppose that you slice through this three-dimensional 
diagram to get the flat two-dimensional surface containing the x- and 
j-axes. The slant line EF in which the plane 3x + Ay = 12 cuts this flat 
surface is the line whose equation is 3x + Ay = 12 in this 
two-dimensional space. The solution to Question (3) in Exercise 11.C.4 
gives the equation of this same line in three-dimensional space. 

exercise n.c.6 Try these questions yourself now. 

(l) Find the perpendicular distance from the origin to each of the following planes. 

(a) 3x + 2y - 6z = 6 

(b) 8x - 15Z = 12 

(c) 2X - y + 2Z = 7 

(d) 4X - 7/ - 4Z = 18 
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(2) The plane n contains the three points P, Q and R with p = -i + 3j, 
q = i + 2j + 6k and r = i + 12k. Find its cartesian equation. 



11.C.Q) Finding where a line intersects a plane 

Suppose we have the line L with equation r = 4i - j + 2k + r(i + 2k) and the plane n with 
equation 3x - 2y + 5z = 11. We want to find the point P where the line L cuts the plane II 

First, we check that the line and the plane aren't parallel. The direction vector for the line 
is d = i + 2k. The normal vector to the plane is n = 3i - 2j + 5k. Working out the dot product 
of d and n gives us 13 so d and n aren't perpendicular to each other. Therefore the line and 
the plane aren't parallel. 

Suppose that line L cuts the plane II at the point P with position vector p and p = 
xi + yj + zk. 

P lies on the line so, working with column vectors, we have 







1 4 




hi 












But P also lies in the plane, so these values for x, y and z must also satisfy the equation of 
the plane. Therefore we can say 

3(4 + t) - 2(-l) + 5(2 + 2t) = 11. 

Solving this equation gives us t = -1 so the point of intersection of the line and the plane 
has the position vector p = 3i - j. 



exercise 11.C.7 Try these questions yourself. 

Find the point of intersection, if it exists, for each of the following pairs of lines 
and planes. 

(1) Line L with the equation r = -2i + sj + k + f fei - j + k) and plane IT with the 
equation 2x + - up = 3 

(2) Line L with the equation r = 31 — sj + 4k + f (2i + 3] - 4k) and plane n with 
the equation 2x + 3/ - 4Z = 4 

(3) Line L with the equation r = i - 4k + r(2i - 3] + k) and plane n with the 
equation 4* - sy + z = 12 

(4) Line L with the equation r = i + 2] - 5k + r(2i - 3] + k) and plane n with the 
equation 2x + 57 - 3Z = 6 



ll.C.(k) Finding the line of intersection of two planes 

Any two planes will intersect in a straight line unless they are parallel. If they are parallel 
then their normal vectors will also be parallel. It's always wise to check this before looking 
for a line of intersection. 

Suppose that plane Il 1 has the equation x - 2y + 3z = 6 and plane Yl 2 , has the equation 

2x + 3y - 5z = 5. 

Do these two planes intersect each other? If so, what is the equation of their line of 
intersection? 

Hi has the normal vector n x = i - 2j + 3k and H 2 has the normal vector n 2 = 2i + 3j - 
5k. Since n t and n 2 aren't parallel, the planes do intersect. 
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Let the equation of the line of intersection be r = a + td. Since this line lies in both planes 
it must be perpendicular to both iij and n 2 . Therefore the cross product of n x and n 2 will give 
us a direction vector d for this line. 

Working this out, we get n t x n 2 = d = i + 1 1 j + 7k. Check this for yourself. 

To find a, we need to find the position vector of a point which lies on both planes and 
therefore on their line of intersection. Any such point will do. One way to find such a point 
is to put z = 0 in the equations for both planes, and then solve the two resulting simultaneous 
equations for x and y. This gives us 

| *-2y = 6 (1) 

I 2x + 3y = 5 (2) 

The solution of these two simultaneous equations is x = 4 mdy = -1. (See Section 2.C.(b) 
if necessary.) 

Therefore the points with coordinates (4, - 1, 0) lies on both planes. Its position vector 
from the origin is a = 4i — j . We can now write the equation of the line of intersection of Tl l 
and n 2 as 

r = 4i - j + t(i + 1 lj + 7k). 

To find a, I could equally well have chosen to put x = 0 or y = 0 instead of z = 0. I chose 
z because, in this case, it made the equations easier to solve. You can check for yourself that 
if you put x = 0 you get the different point A with a = -45 j - 28k. 

Now we have the equation of the line of intersection of and Yl 2 as r = -45j - 28k 
+ s(i + 1 lj + 7k). Although the equation looks different, it gives the same line as r = 4i - 
j + t(i + 1 lj + 7k). For example, the point with position vector r = 7i + 32j + 21k lies on 
the line. To get it, we put t = 3 or s = 1. 

Since two non-parallel planes intersect in a straight line, we can say that, in general, any 
pair of equations ax + by + cz = d and fx + gy + hz = m will represent a line provided the 
vectors ai + b\ + ck and fi + gj + /zk are not parallel. 

exercise n.c.8 Now try these questions yourself. 

Find whether the following pairs of planes intersect and, if so, the equations of 
their lines of intersection. 

(1) Plane H with the equation 2x - sy + z = 4 and plane IX, with the equation 
x + iy - z = 5 

(2) Plane n 3 with the equation x - jy + 5Z = 4 and plane II 4 with the equation 
-3X + 9/ - 15Z = 8 

(3) Plane n 5 with the equation 4* - y + z = 9 and plane n 6 with the equation 
2x + yy - 7Z = 8 

11. D Finding angles and distances involving lines and planes 

Because vectors carry the physical information of both magnitude and direction, using them 
gives us a powerful method for calculating angles and distances in three dimensions. 

ll.D.(a) Finding the angle between two lines 

Suppose we have two lines with the equations r = a t + 5d x and r = a 2 + td 2 - The angle 
between the two lines is given by the angle between their two direction vectors d t and d 2 . 
I show this in Figure ll.D.l. 
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Figure ll.D.l 



If the lines don't cut each other, the angle between them is defined to be the angle we 
would get by sliding the two direction vectors towards each other until their tails meet 
while keeping their directions unchanged. We can find this angle using the dot product since 
d x • d 2 = | d x 1 1 d 2 1 cos 0 where 0 is the angle between d t and d 2 . 

Here is a numerical example showing how the working out goes. 

Suppose we have the two lines L x and L 2 with these equations. 

r = 2i + 3j - k + s{\ - 2j + 2k) L x 

r = i - 3j + 4k + t(2i - 6j - 3k) L 2 

We find the angle between them by calculating the dot product of their two direction vectors. 
Working in column vectors, we have 

-2 j • ^ -6 j = V(l + 4 + 4) V(4 + 36 + 9) cos 6. 

This gives 2 + 12 - 6 = 8 = 21 cos Q so 6 = cos" 1 ^!) = 67.6° to 1 d.p. 

It may happen that the angle 9 will be obtuse. For example, if we reverse the direction 
of d 2 in Figure ll.D.l, then 0 will be obtuse. In this case, working out n • d will give a 
negative answer and cos -1 0 will be an obtuse angle. We can then find the acute angle 
between the lines by subtracting 0 from 180°. Or we can avoid this problem by using the 
absolute value of d x • d 2 , which we write as |d t • d 2 |, when we find 0. This gives us the 
following general rule. 



The angle 8 between the two lines r = a x + sd t and r = a 2 + td 2 is given by 
|di *d 2 | 



6 = cos 1 



IdJId, 



exercise ii.d.i (l) One pair of lines given below is parallel and one pair is perpendicular. Which 

are they in each case? What is special about the other two pairs? 

(a) r = 2i - j + 4k + s(2i - 3j" + k) and r = i + 6j + k + t(3'\ + 4j + k) 

(b) r = i - 2j + 4k + s(3i - 2] + k) and r = 2i + j + 5k + r(-6i + 4j - 2k) 

(c) r = i + 3j - 2k + s(i - 3j + 4k) and r = -21 - 6j + 4k + f (21 + f\ + k) 

(d) r = 21 + j - 3k + s(s'i - 2] + 3k) and r = 41 + 3j + k + ffei + 6j - k) 
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(2) Find the acute angle between each of the following pairs of lines giving your 
answers correct to 1 d.p. 

(a) r = 2i - 5j + k + s(4i - 7] - 4k) and r = j'\ - j\ + 2k + f^3i - 4j + 12k) 

(b) r = i - j] + 2k + s(3i - j + 4k) and r = 21 + sj + k + f^2i + f\ + k) 

ll.D.(b) Finding the angle between two planes 

It is important to choose the correct angle here. It is defined as the angle between two lines, 
one in each plane, so that they are at right angles to the line of intersection of the two planes, 
like the angle between the tops of the pages of an open book. Figure 1 1 .D.2 shows part of 
two planes and the angle 9 between them. 




Figure 11.D.2 



Now this angle is the same as the angle between the normal vectors to the two planes. If 
these vectors are n t and n 2 then, using the dot product as in the last section, we have 
n x • n 2 = I n x 1 1 n 2 1 cos 9 where 9 is the angle between and n 2 . 

Here is a numerical example of this. 

Suppose we have the two planes TTj and Yl 2 with these equations. 

3x - y - 2z = 7 III 

4x + 2y - 5z = 6 U 2 

We find the angle between them by calculating the dot product of their two normal vectors, 
3i — j — 2k and 4i + 2j - 5k. Working in column vectors, we have 

4 \ 

2 = V(9 +1+4) V(16 + 4 + 25) cos 9. 

This gives 12 - 2 + 10 = 20 = /l4 ^45 cos 9 giving 9 = 37.2° to 1 d.p. 

If one of the normal vectors to the two planes points upwards and one points downwards, 
then the angle between them will be obtuse. For example, in Figure 11. D.2, if n 2 had been 
pointing in the opposite direction, then the angle between rij and n 2 would have been 
(180° - 9) and we would have found the obtuse angle between the two planes. The acute 
angle is then found by subtracting this from 180°. 
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Once again, we can avoid this problem by using li^ • n 2 | rather than n t • n 2 . This gives 
us the following general rule. 



The angle 0 between the two planes r • n t = Q and r • n 2 = C 2 is given by 




EXERCISE 11.D.2 



Find the acute angles between the following pairs of planes giving your answers 
to 1 d.p. 



(1) 3x - sy + z = 4 and 5X + 2y - z = 5 

(2) 4* - sy + 2z = 6 and 2x + 2/ + z = 3 

(3) x - 7y + 3Z = 8 and 4X - y - 5Z = 6 



ll.D.(c) 



Finding the acute angle between a line and a plane 



Again the neatest method is to use a normal vector to the plane. I show how this works in 
Figure 1 1.D.3 with (a) showing the case where n and d are both pointing out from the same 
side of the plane and (b) showing them pointing out from opposite sides. 



Figure 11.D.3 

In both cases, the vector n is normal to plane IT and the vector d gives the direction of 
line L. We slide n until its tail is at the point of intersection of the line L with the plane II 
Then n and L together lie in a plane which is perpendicular to plane II. The angle which the 
line L makes with the plane II is defined to be angle 9. 

To find 0 we first find 0 which is the angle between n and d. We'll consider cases (a) 
and (b) separately. 

(a) Here, we have (p = 90° - 0 and n • d = |n||d| cos (p = |n||d| cos (90° - 0) 
= |n||d| sin 9 using a result from Section 4.A.(h). 

(b) This time, we have (p = 90° + Q. 

This gives us n • d = |n||d| cos (p = |n||d| cos (90° + 6) = |n||d| (- sin 9). 

Since <p is obtuse, working out n • d will also give a negative number, so we have 

matching minus signs on both sides of this equation. 
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It is more convenient to have just one rule which will work for both (a) and (b). We can 
get this by taking the absolute value of n • d which we write as |n • d|. Now |n • d| is positive, 
whatever the sign of n • d and we can now write down this single rule. 



The angle 9 between L and EE is given by 

x /In • d| 



|n • d| = |n||d| sin 9 or 9 = sin 



Here is a numerical example of this. 

Suppose we are asked to find the angle between line L and plane II with these equations 
respectively. 

r = 2i + 3j - k + s(2i - 3j - 5k) L 

3x - 5y + z = 6. II 

We have n = 3i - 5j + k and d = 2i - 3 j - 5k. Using |n • d| = |n 1 1 d| sin 6 and working with 
column vectors, we have 



= V(9 + 25 + 1) V(4 + 9 + 25) sin 9. 

This gives us |6 + 15 - 5| = 16 = ^35 V^S sin 9 giving 9 = 26.0° to 1 d.p. The angle between 
line L and plane IT is 26.0° to 1 d.p. 

If n • d gives a negative answer, and you need to avoid complications in further 
calculations, you could always reverse the direction of either n or d. 




exercise 11.D.3 Find the acute angle between each of the following pairs of lines and planes 

giving your answers correct to l d.p. 

(1) r = 2i - 7j + k + sfei - 2j + k) and 5X - 2y + z = 8 

(2) r = 4i - j + s(i - 7i + 3k) and 2x - 5Z = 3 

(3) r = i + j + 2k + s(s'\ + 7\ + 11k) and 3* - y + 2z = 5 



ll.D.(d) Finding the shortest distance from a point to a line 

If we know the position vector p of a point P relative to the origin O, and the equation of 
a line L, how can we find the shortest distance from P to IP. 

I shall use a numerical example to answer this question since I can then explain both the 
principles and how the working out goes. Suppose we are given the following point P and 
line L. 

OP = p = 2i + j + 5k for point P 

r = i + 6j - 2k + t(3i + j + 4k) for line L 

The shortest distance of P from line L is given by the length of the perpendicular from P to 
L. Suppose this perpendicular meets L at H. Then we want to find the length of PH. I show 
this in Figure 11.D.4. 
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Figure 11.D.4 



Since H lies on L, OH must satisfy the equation of L. Working with column vectors, this 
gives us 



OH=h = 




for some value of t which we need to find. 
Also, PH = TO + ~OH = -p + h so we have 



PH = 




But the vector PH is perpendicular to the direction vector d of line L. So the dot product of 
PH and 3i + j + 4k must be zero. This gives us 



1 ] = 3( 1 3/) ' (5 - I) + 4( 7 f 4/) = 0. 



From this, we get 26? = 26 so t = 1 . Therefore PH = 2i + 6j - 3k. 
The length of PH is given by 




\PH\ = V2 2 + 6 2 + (-3) 2 = 7. 
The shortest distance of the point P from the line L is 7 units. 



exercise ii.D.4 Find the shortest distances between the following pairs of points and lines, giving 

your answers correct to l d.p. if they aren't exact. For each pair, I have let p be the 
position vector of the point P and called the line L 

(1) p = 2i - j + 2k and L is r = -i + 7k + r(zji + j - 2k) 

(2) p = i - j + 5k and L is r = 21 - nj - 8k + f (i + 4j + 3k) 

(3) p = —31 — 2j + 6k and L is r = i + 4k + r(2i + j - k) 

(This one is a bit sneaky and has a fast answer if you can spot it.) 



ll.D.(e) Finding the shortest distance from a point to a plane 

There is a neat way to tackle this problem. I'll explain how this works by taking a numerical 
example and then I'll summarise the method in a general rule. 
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We'll find the shortest distance of the points with OA = a = 3i - 2j + 5k from the plane 
IT with equation 2x + 6y - 3z = 28. This distance is given by the length of AH where AH is 
perpendicular to II and H lies in II To find it, we split the problem into two parts. 

First, we find the perpendicular distance from the origin to the plane. Its equation is 
2x + 6y - 3z = 28 which we can rewrite in the form r • n = 28 with n, the normal vector 
to the plane, given by n = 2i + 6j - 3k. 

We have |n| = ^49 = 7. Therefore n = \(2i + 6j - 3k) gives us a unit normal vector to 
the plane. 

Now we have r*n = h*n = -(28) = 4 so the perpendicular distance of the plane II from 
the origin is 4 units as discussed in Section ll.C.(h). 

Next, we find the distance of the point A from the origin in the direction of n by working 
out n • a. Using column vectors, this is 



n • a = 




We need to keep the negative sign here because it indicates that^4 and II lie on opposite sides 
of the origin. We see that the total perpendicular distance from A to II is 7 units. You can 
see how the distances work out in this particular example in Figure 1 l.D.5. 1 have shown the 
direction of n so that it agrees with the n we found above, with n • h being positive. Notice 
that this means that n • a will give a negative answer since the angle between n and a is 
obtuse. 




Figure 11.D.5 



We can now write down a general rule for finding the shortest distance of a point A with 
position vector a relative to the origin from a plane II with the equation n • r = ax + by + 

cz = C. 
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The shortest distance from A to IT is given by 


In • a-Z)| 

1 1 


n 


C 


where n = — with n — cii + b\ + ck and D ~ 




|n| 


R 



Writing the rule in this way takes care of all the different possibilities for the relative 
positions of A and IT with respect to the origin. In the numerical example we worked out 
above, we have |n • a - D\ = |- 3 - 4| = 7 units. 
Now try these questions yourself. 

exercise 11.D.5 (l) Find the shortest distance of the point A from the plane n in the following two 

cases. 

(a) The equation of n is ix - y + iz = n and OA = a = i + 3j - 3k 

(b) The equation of n is 3* + 12/ - 4Z = 79 and OA = a = 2i + f\ + 3k 
(2) The following question comes from a problem which sometimes faces 

computer graphics programmers when they want to put highlights on surfaces 
to add realism. 

A ray starts from the point A with position vector OA = a = i - 24k. It strikes 
the plane II with equation 2x - 2y + z = 5 at the point M and is then reflected 
through the point B with position vector OB = b = i + 4j + 2k. I show a sketch 
of the ray's path marked with double arrows in Figure 11.D.6. 




Figure 11.D.6 



The angle of incidence of the ray with the plane is equal to the angle of 
reflection so the angles which AM and BM make with the normal vector n are 
equal. 

You want to trace the path of the ray by finding the position vector OM = 
m of the point M and the size of the equal angles which the ray makes with 
the plane II. 

In order to do this, answer the following questions. 

(a) Find the position vector I of L from 0. (Hint: L lies on the line AL which is 
parallel to n and has the known point A lying on it.) 

Use your answer to find the length of AL 

(b) Find the position vectors of S from 0. Use this answertofind the length of BS. 
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(c) From the two triangles ALM and BMS, what can you say about AL/BS and 
LM/MS? Use this, and the ratio theorem (see Example (3) in Section 
n.A.(b) if necessary) to find OM = m. 

(d) Finally, find the angle which the ray AM makes with the plane II. The 
reflected ray MB will, of course, make an equal angle with this plane. 



Skew lines are lines which are not parallel and which do not intersect each other. To find the 
shortest distance between a pair of skew lines, we shall again make use of the equation of 
a plane in the form r • n = D described in Section 1 l.C.(h). In this equation, r is the position 
vector from the origin of any general point in the plane, n is a unit normal vector to the 
plane, and \D\ is the perpendicular distance from the origin to the plane. 

First, I'll describe the general method for finding the shortest distance between two skew 
lines and then I'll take a particular example to show how the working out goes. 

Suppose that the two lines are L x and L 2 with these equations. 



The direction vector of line L x is d 1; and a x is the position vector of a known point A x on 
this line. 

Similarly, the direction vector of line L 2 is d 2 , and a 2 is the position vector of a known 
point A 2 on this line. 

Now, imagine sliding a copy of line L 2 up towards line L x until it intersects with it at^ x . 
It doesn't matter what path you take so long as you keep the direction of L 2 unchanged. Then 
line L x and the shifted line L 2 will lie in a plane, Yi x , and the point A x with position vector 
a x lies on this plane. Also, we can find a normal vector for W x by working out the cross 
product of d t and d 2 since d t and the shifted d 2 both lie in ^ . The normal vector can then 
be adjusted to give a unit normal vector for plane U x which I'll call n. 

We can now write the vector equation of H x in the form r»n = a»n = Z) 1 where \D X \ 
is the perpendicular distance of plane H x from the origin taken as positive regardless of the 
calculated sign of D x . 

Similarly, we can shift a copy of line L x until it intersects with line L 2 atA 2 . The shifted 
line L x and line L 2 lie in a second plane H 2 with the point A 2 lying in it. The planes and 
n 2 are parallel to each other since the n already found is a unit normal vector to n 2 as well. 
This is because d 2 and the shifted d x both lie in IT 2 , so their cross product also gives a 
normal vector to H 2 . So the vector equation of plane IT 2 can be written in the form r • n = 
a 2 • n = D 2 where \D 2 \ is the perpendicular distance of plane IT 2 from the origin. 

The shortest distance between the two skew lines L x and L 2 is now given by - D 2 \. 
We must use D x and D 2 here, including their signs, because this takes care of whether the 
two planes are on the same side or opposite sides of the origin. (We had to do something 
similar when we wrote down the rule at the end of the previous section.) Notice also that 
Z>i = a, • n and D 2 = a 2 • n so \D X - D 2 \ = |(a x - a 2 ) • n|. 

This gives us the following general rule. 



The shortest distance between the two skew lines r = a t + sdx and r = a 2 + fd 2 



ll.D.(f) 



Finding the shortest distance between two skew lines 



r = a t + sd x 
r = a 2 + ?d 2 . 



L 



is given by |(a x - a 2 ) • n| where n = 



d t x d 2 
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Working with vectors 



Here is a numerical example. We shall find the shortest distance between the two skew 
lines L x and L 2 with these equations. 



r = i + 2j + 2k + j(4i + 3j + 2k) 
r = i - 3k + f(4i - 6j - k) 



Li 
L 2 



The point A l with position vector a x = i + 2j + 2k lies on L x . Shifting a copy of L 2 until it 
intersects with L x atA l gives plane O x with the point A x lying in it. 

The point A 2 with position vector a 2 = i - 3k lies on L 2 . Shifting a copy of L x until it 
intersects with L 2 at A 2 gives plane II 2 with the point A 2 lying in it. 

The cross product of the two direction vectors, 4i + 3j + 2k and 4i - 6j - k gives a 
common normal vector n to both planes. Working this out gives 



n = 



l 

4 
4 



k 

2 
-1 



= 9i + 12j - 36k. 



We have |n| = -y/ 1521 = 39 so n = 
normal vector to both planes. 

Now, working with column vectors, we have 



a t • n = n • a t 



3^ (9i + 12j - 36k) = j?j(3i + 4j - 12k) gives a unit 




for IIj 



and 



a 2 • n = n • a 2 




3 = £> 2 for II 2 . 



So the shortest distance between the two skew lines L x and L 2 is given by 

\D X - D 2 \ = \&! • n - a 2 • n| = |- 1 - 3| = 4 units. 

The different signs of D { and D 2 show that these two planes lie on opposite sides of the 
origin. I've shown a sketch of this in Figure 11.D.7. 




I 

4* 



Figure 11.D.7 



11. D Finding angles and distances 
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exercise ii.D.6 Now try these for yourself. 

Find the shortest distance between each of the following pairs of skew lines. 

(1) /.j is given by r = 31 + j + 7k + s(j + 2k) and 
L 2 is given by r = -i + 3j + 2k + t (31 + 3\ + 4k) 

(2) /.j is given by r = 31 + 2) - 5k + s(2i + 4j - 5k) and 
L 2 is given by r = i - 3] - 5k + f (i + k) 

(3) l-i is given by r = 41 - j - 2k + 5(41 + 4j + 3k) and 
L 2 is given by r = -i + 2] + 3k + t(,i^\ - 3] + 3k) 
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Working with vectors 



Answers to the exercises 



Chapter 1 

Exercise 1.A.1 

(1) (a) 8a + b + 3c (\b is written as just b.) (b) 5ab + 5a + 3b (since ab = ba.) 
(c) 5p + 5pq + 8a (d) 2x + 5y + 3xy 

(2) (a) 2 3 = 8 (b) 5a 2 = 5 X 2 2 = 20 (c) (5a) 2 = 10 2 = 100 (d) l 2 = 1 

(e) 8 + 3 = 11 

(3) (a) 6xy (b) I5x 3 y (c) 6a + 9b (d) 6a 2 + Wab (e) 6p 3 + 4/? 2 a + 2/;a 2 

(f) 6x 3 + 4x 3 >> + 2x 2 y 2 

Exercise 1.A.2 

(1) (a) afe + erf = 2 X 3 + 4 X 5 = 6 + 20 = 26 (b) afe 2 e = 2 X 3 2 X 0 = 0! 

(c) ab 2 d = 2X 3 2 X5 = 2X9X5 = 90 (d) (abdf = (2 X 3 X 5) 2 = 30 2 = 900 

(e) a(b + cd) -2(3 + 4X5) - 2(3 + 20) - 46 

(f) a6 2 d + c 3 - 2 X 3 2 X 5 + 4 3 - 90 + 64 = 154 

(g) dl + d-c-2X3 + 5-4-6 + 5-4-7 

(h) a(Z> + d) - c = 2(3 + 5) - 4 = 2 X 8 - 4 = 16 - 4 = 12 

(2) (a) 3x(2x + 3y) + 4>>(x + 7y) = 6x 2 + 9xy + 4xy + 28j 2 = 6x 2 + 13xj + 28j 2 

(b) 5^ 2 (2/> + 3a) + a 2 (3p + 5a) + /?a(p + 2a) = 10p 3 + I5p 2 q + 3^a 2 + 5a 3 + p 2 q + 
2/?a 2 = \0p 3 + 16/? 2 a + 5/>a 2 + 5a 3 

(3) (a) a 2 = 3 2 = 3 X 3 = 9 (b) 3b 2 = 3 X 4 2 = 3 X 16 = 48 (c) (3b) 2 = (12) 2 = 144 
Notice this last pair! Students quite often confuse (b) and (c). In (b), only the b is squared. 

(d) c 2 = l 2 = 1 X 1 = 1 (not 2!) (e) ab + c = 3 X 4 + 1 = 12 + 1 = 13 

(f) bd-ac = 4X5-3X1= 20 - 3 = 17 (g) 6(d - ac) = 4(5 - 3 X 1) = 4(2) = 8 

(h) d 2 - b 2 = 5 2 - 4 2 = 25 - 16 = 9 (i) (d - b) (d + b) = (5 - 4) (5 + 4) = 9 

0') d 2 + b 2 = 5 2 + 4 2 = 25 + 16 = 41 (k) (d + b)(d + b) = (5 + 4)(5 + 4) = 9 X 9 = 81 

(1) a 2 + c 2 d = 3 2 X 4 + l 2 X 5 = 9 X 4 + 1 X 5 = 36 + 5 = 41 

(m) 5e(a 2 -35 2 ) = 0 since it is all multiplied by 0 

(n) a h + d a = 3 4 + 5 3 = 81 + 125 = 206 

(4) (a) 3ailb + 3c) + 2a(b + 5c) = 6ab + 9ac + lab + lOac = 8ab + 19ac 

(b) 2xy(3x 2 + 2xy + y 2 ) = 6x 3 y + 4x 2 j 2 + 2xy 3 

(c) 5p(2p + 3a) + 2q(3p + q) = I0p 2 + \5pq + 6pq + 2q 2 = I0p 2 + 2\pq + 2q 2 

(d) 2c 2 (3c + 2d) + 5a" 2 (2c + d) = 6c 3 + 4c 2 a' + lOcd 2 + 5a" 3 

Exercise 1.A.3 

(1) 2x - (x - 2y) + 5y = 2x - x + 2y + 5y = x + ly 

(2) 4(3a - 2b) - 6(2a - b) = 12a - &b - 12a + 6b = -2b 

(3) 6(2c + d) - 2(3c - d) + 5 = 12c + 6d - 6c + 2d + 5 = 6c + 8d + 5 
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(4) 6a - 2(3a - 5b) - (a + 4b) = 6a - 6a + 106 - a - 4b = - a + 6b 

(5) 3x(2x - 3y + 2z) - 4x(2x + 5y - 3z) = 6x 2 - 9xy + 6xz - 8x 2 - 20xy + Uxz = -2x 2 - 29xy 
+ 18xz 

(6) 2xy(3x -4y)- 5xy(2x -y) = 6x 2 y - 8xy 2 - 10x 2 y + 5xy 2 = - 4x 2 y - 3xy 2 

(7) 2a 2 (3a - 2ab) - 5ab(2a 2 - 4ab) = 6a 3 - 4a 3 b - I0a 3 b + 2§a 2 b 2 = 6a 3 - \4a 3 b + 20a 2 b 2 

(8) -3p - p - q + 2pq - 6q = - 4p - Iq + 2pq 

Exercise 1.A.4 

(1) 5(a + 2b) (2) a(3a + 2b) (3) 3a{a - 2b) (4) x(5y + 8z) 

(5) 5x(y - 2z) (6) ab(a + 3b) (7) 2pq{2q - 3p) (8) x 2 y\3y + 5x) 

(9) 2pq(2p + q - 3pq) (10) a 2 6 2 (26 + 3a - 6) 



Exercise l.B.l 

(1) (x + 2)0 + 3) = x 2 + 2x + 3x + 6 = x 2 + 5x + 6 

(2) (a + 3)(a - 4) = a 2 + 3a - 4a - 12 = a 2 - a - 12 

(3) (x " 2)0 " 3) = x 2 - 2x - 3x + 6 = x 2 - 5x + 6 

(4) (p + 3)(2p + 1) = 2p 2 + +/7 + 3 = 2p 2 + Ip + 3 

(5) (3x - 2)(3x + 2) = 9x 2 - 6x + 6x - 4 = 9x 2 - 4 

(6) (2x - 3y)0 + 2y) = 2x 2 - 3xy + 4xy - 6y 2 = 2x 2 + xy - 6y 2 

(7) (3a - 2b)(2a - 5b) = 6a 2 - 4ab - \5ab + \0b 2 = 6a 2 - 19aZ> + \0b 2 

(8) (3x + 4yf = (3x + 4y)(3x + 4y) = 9x 2 + Uxy + 12x>> + \6y 2 = 9x 2 + 24xy + I6y 2 

(9) (3x - 4y) 2 = (3x - 4y) (3x - 4y) = 9x 2 - 12xj - 12x>> + I6y 2 = 9x 2 - 24xy + 16y 2 

(10) (3x + 4y) (3x - 4y) = 9x 2 + 12x>> - I2xy - \6y 2 = 9x 2 - \6y 2 

(1 1) (2p 2 + 3pq)(5p + 3a) = 10^ 3 + 15^ 2 a + 6p 2 q + 9,Da 2 = I0p 3 + 2\p 2 q + 9^-a 2 

(12) (2ab - b 2 ) (a 2 - 3ab) = 2a 3 b - a 2 b 2 - 6a 2 b 2 + 3ab 3 = 2a 3 b - la 2 b 2 + 3aZ> 3 

(13) (a + b) {a 2 -ab + b 2 ) = a 3 - a 2 b + ab 2 + a 2 b - ab 2 + b 3 = a 3 + b 3 

(14) (a - b) (a 2 + ab + ft 2 ) = a 3 + a 2 6 + ab 2 - a 2 b - ab 2 - b 3 = a 3 - b 3 
These last two often have useful applications so I have separated them out. 

(15) The answer to (b) comes out as 1 less than the answer to (a) each time. 

Using n we have the starting number plus 1 is n + 1 and the starting number minus 1 is 
n- 1. 

These two, multiplied together, give (n + 1) (n - 1) which is n 2 - 1. 
This is 1 less than the starting number squared, and therefore the answer to (b) will 
always be 1 less than the answer to (a). 



Exercise 1.B.2 



(1) (x + 7)0 + 1) (2) (p + 5)(p + 1) (3) (x + 6)(x + 1) (4) (x + 2)(x + 3) 
(5) (y + 3)0 + 3) or (y + 3) 2 (6) (x + 2)0 + 4) (7) (a + 2)(a + 5) 
(8) (x + 4)(x + 5) (9) (x + 4)(x + 9) 



Exercise 1.B.3 

(1) (3x + 5)(x + 1) (2) (2y + 1)0 + 7) (3) (3a + 2)(a + 3) 
(4) (3x + l)(x + 6) (5) (5/7 + 3)0 + 4) (6) (5x + 6)0 + 2) 



Exercise 1.B.4 

(1) (x-3)(x-8) (2) 0-3)0-6) (3) (x-2)(x-9) (4) (p - 3)(p + 8) 
(5) (x-2)(x + 6) (6) (2a+l)(a-3) (7) (3x + 2)(x-4) (8) (2a-5)(a+l) 
(9) (2x + 3)(x - 4) (10) (3b - 2)(b - 6) (1 1) (3x - 5y)(3x + 5y) 
(12) (4x 2 - 9v 2 )(4x 2 + 9y 2 ) = (2x - 3y)(2x + 3y) (4x 2 + 9y 2 ) 
The 4x 2 + 9y 2 won't factorise any further. 
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Answers to the exercises 



Exercise l.C.l 

3 1 5 3 5 b 3y 
(1) - (2) - (3) — (4) - (5) - (6) - (7) — 

4 5 19 8 8 c 2 

2a 3x 6p 5a 

(8) 4p (9) — (10) — (11) — (12) — 

b 2y 5q b 2 

Exercise 1.C.2 

(1) (a) They are all equivalent except for \ and §. 

(b) They are all equivalent. 

(c) The middle one is the odd one out. 

(d) The first two are equivalent because the z cancels, but it isn't possible to cancel the p in 
the last one. 

, s 2{x + x + 3 y 3(7fl " 3b ) 3 , s p( x ~ x ~ a 

(2) (a) = (b) = - (c) = 

2(3x - 4y) 3x - 4y 2(2a - 3b) 2 p(p - x) p - x 

x(2y + 5z) 2y + 5z 2z(2x + 3y) 

(d) No simplification is possible, (e) = (f) = 2z 

6x 6 2x + 3y 

(x - y)(x + y) x-y 
(g) No simplification is possible, (h) = 

(x + y)(x + y) x + y 

(x + 2)(x + 3) x + 3 

(i) = 

(x + 2)(x - 1) x - 1 

Exercise 1.C.3 

(1) 
(2) 

1 2 4 1 X (3 X 5) 2 X (2 X 5) 4 X (2 X 3) 15 20 24 59 

(3) — + — + — = + + = — + — + — = — 

2 3 5 2 X (3 X 5) 3 X (2 X 5) 5 X (2 X 3) 30 30 30 30 

Exercise 1.C.4 

2 7 10 21 31 

(1) - + _ = _ + _ = _ 

9 15 45 45 45 

(Notice, there was a common factor of 3 on the bottom.) 

5 3 5X4 3X3 20 9 29 

(2) - + - = + = _ + _ = _ 

6 8 6 X 4 8 X 3 24 24 24 

1 3 5 1X4 3X3 5X2 4 9 10 23 

(3) — + — + — = + + = — + — + — = — 

3 4 6 3X4 4X3 6X2 12 12 12 12 

(4) There is a common factor of (2x - y) on the bottom. So you should have 

3x 5y 3x 2 5y 2 3x 2 + 5y 2 

+ = + = . 

y(2x - y) x(2x - y) xy(2x - y) xy(2x - y) xy(2x - y) 



3 


2 


3X7 


2X4 


21 


8 


29 




+ — 




+ 




+ — 




4 


7 


4X7 


7X4 


28 


28 


28 


2 


4 


2X5 


4X3 


10 


12 


22 




+ — 




+ 




+ — 




3 


5 


3X5 


5X3 


15 


15 


15 
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(5) There is a common factor of x on the bottom, so we can say 

2 5 2(2x - 1) 5(3x + 1) 
+ = + 



x(3x + 1) x(2x - 1) x(3x + l)(2x - 1) x(2x - l)(3x + 1) 

I9x + 3 



x(2x - l)(3x + 1) 
(6) There is a hidden common factor of (x + y) so we get 
4 3 4 3 



+ 



x 2 - y 2 (x + y) 2 (x - y)(x + y) (x + y)(x + y) 

4(x+y) Xx-y) 



(x - y)(x + y)(x +y) (x + y)(x + y)(x ■ 

4x + Ay + 3x - 3y 
(x - y)(x + y f 

Ix+y 



(x - y)(x + y f 

or you can write this as 

Ix+y 
(x 2 -y 2 )(x+y)' 

Exercise 1.C.5 

I shall put in all the brackets from the start in these answers. 

(3x-5) (2x-3) 3(3x-5) 2(2x - 3) 

(1) + = + 

10 15 3 X 10 2 X 15 

9x-15+4x-6 13x-21 



(2) 



30 30 

(3a + 5b) (a - 3b) (3a + 5b) 2(a - 3b) 
4 2 4 2 X2 

3a + 5b - 2a + 6b a + \ lb 



(3) 



4 4 

(3m - 5m) (3m - In) (3m - 5ri) 3(3m - In) 
6 2 6 3 X2 

3m - 5n - 9m + 21n -6m + 16m 



(4) 



6 6 

2(-3mj + 8m) -3m + 8m 
6 3 

2b 3a 2b 2 3a 2 



a(2a + b) b(2a + b) ab(2a + b) ab(2a + b) 

3a 2 + 2b 2 
ab(2a + b) 
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Answers to the exercises 



(5) 



2a 3ft 2a(a - ft) 3b(a + ft) 

+ = + 

(a + b)(3a + ft) (a- b)(3a + b) (a + b)(3a + b)(a - b) {a- b)(3a + b)(a + b) 

2a 2 - 2ab + 3ab + 3ft 2 2a 2 + ab + 3ft 2 



(6) 



(a + b)(3a + b)(a - ft) (a- b)(3a + b)(a + ft) 
2 



x 2 - y 2 x(x + y) (x - y)(x + y) x(x + y) 

(using the difference of two squares! Did you spot it?) 

5x 2{x -y) 5x-2x + 2y 

x(x - y)(x + y) x(x - y)(x + y) x(x - y)(x + y) 

3x + 2y 3x + 2y 

or, equally nice, 



x(x - y)(x + y) ' x(x 2 - y 2 ) 

Exercise 1.C.6 

(1) The x underneath is a common factor, so 

2 3 A{x + Ay) 3(2x - 3y) 



x(2x - 3y) 2x(x + Ay) 2x(2x - 3y)(x + Ay) 2x(x + Ay)(2x - 3y) 

(Notice, only the x is common, so we multiplied the first 
fraction top and bottom by 2(x + Ay).) 

A(x + Ay) - 3(2x - 3y) 25y - 2x 

2x(2x - 3y)(x + Ay) 2x(2x - 3y){x + Ay) ' 

When the two fractions are combined the top has two chunks which are subtracted. This has 
been tidied up by multiplying out the brackets, and then putting together as much as 
possible. The bottom, with the brackets multiplied is already in a neat factorised form, and 
so we generally leave it this way. 

(2) First, we put in the brackets and get 

(2x-l) (jc-7) 5(2x-l) 3(x-7) 
3 5 5X3 3X5 

10.x - 5 - 3x + 21 7x+16 



15 15 

3a 2 ab 3a 3 b a 3 

(3) (a) X — = = — , dividing top and bottom by 3ft. 

2ft 6c 12ftc Ac 

2a ft 2 2a 9a 2 18a 3 6a 3 

(b) — - = — X = = 

3ft 9a 2 3ft ft 2 3ft 3 ft 3 

3x 2x 2 3x 5vz 2 15xyz 2 15z 

(c) ■*■ = X — = — = 

y 2 z 5yz 2 y 2 z 2x 2 2x 2 y 2 z 2xy 

dividing top and bottom by xyz. 

3x 2 (2x + 3y) y 2 (x - y) 3x 2 y 2 (2x + 3y) (x - y) 3xy(2x + 3y) 

(4) (a) X = = 

2y(x - y) x(x + 3y) 2xy(x - y)(x + 3y) 2(x + 3y) 

It is quicker to do the cancelling before the multiplying (which you can show by 
crossing through the same factors above and below, if you like). 

Then you can leave out the second step of the above working. 

But you must remember that only factors of the whole of the top and bottom can be 
cancelled. 
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^ 5pq(p + q) ^ (3/7 + 2g) _ 5j?(p + q) 
(3p + 2q) q 2 (5p - q) q(5p - q) 

Don't be tempted to cancel the 5s! 

(a 2 - b 2 ) 4 (a 4 - b 4 ) {{a - b) (a + b)) 4 (a 2 - b 2 ) (a 2 + b 2 ) 

(c) X = X 

(a 2 + b 2 ) (a + b) 4 (a 2 + b 2 ) (a + b) 4 

= (a - b) 4 {a 2 - b 2 ) or {a - b f (a + b) if you like 

Exercise l.D.l 

(„ ,2) 16^4 0,-4 ,3, **-&»t 

11 11 

(4) 2T m = = - (5) 4° = 1 (6) 7 1 = 7 (7) 7~ 2 = — = — 

27 i/3 3 I 2 49 

(8) 4 - 1/2 = -^ = 4 = 4 ^ 32U5 = 2 
4 1/2 ±2 2 

(10) 16- 3/4 = — *— = 1— = = — = ±- (11) 25 3/2 = (25 1/2 ) 3 = (±5) 3 = ±125 

7 16 3/4 (16 1/4 ) 3 (±2) 3 ±8 8 

(12) 4r 1/2 = — ^ = — = ± - 
V ' 49 1/2 ±7 7 

Exercise l.F.l 

(1) 10111 = 1 + 1(2 ! ) + 1(2 2 ) + 0(2 3 ) + 1(2 4 ) =1+2 + 4+16 = 23 

(2) 1111 = 1 + 1(2') + 1(2 2 ) + 1(2 3 ) =1+2 + 4 + 8= 15 

(3) 111011 = 1 + 1(2 : ) + 0(2 2 ) + 1(2 3 ) + 1(2 4 ) + 1(2 5 ) = 1 + 2 + 8 + 16 + 32 = 59 



Exercise 1.F.2 
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Exercise 1.F.3 



(1) 


V28 = 


\ 2 X 2 X 7 v2 2 X 7 


= 2v7 


(2) 


4T7 
V45 = 


\3 X 3 X 5 = V3 2 X 5 


<■> IT 
= 3V5 


(31 


V50 


\/2 X 5 X 5 = v2 X 5 2 


= 5V2 


(4) 


.44 = 


\ 2 X 2 X 1 1 x 2' X 


11 = 2V11 


(5) 


v' 63 = 


\ 3 X 3 X 7 x 3 ' X 7 


= 3v7 


(6) 


V40 = 


V2 X 2 X 2 X 5 = v2 2 


X 2 X 5 = 2\''10 



Exercise 1.F.4 

15 5x2 15 5\2 

9-2 7 

14 6x5 7 - 3V5 
4 2 

3 2x3 (3 2x 3) (5 3\2) 15 • 6x6 l()\3 9x2 15 6x6 10x3 9 X 2 

(3) = = = 

5 + 3V2 (5 + 3v2) (5 - 3V2) 25-18 7 

During this working we have used the fact that \/x X \[y = v'xy. An example of this is 
V4 X ^9 = \'36 which is the same as 2 X 3 = 6. 

^^^^ \lx - v is not the same as x.v - \ y. For example, \/9 + 16 = \i25 = 5 but 

»y x 9 + v'T6 = 3 + 4 = 7. 

Chapter 2 

Exercise 2.A.1 

(1) x + 8 = 5 so x = 5 - 8 = -3. (2) 5y = 40 so j = f = 8. 

(3) 2j = 7 so y = 5. 2 

(4) 7 + 2x = 5 - x so 2x + x = 5 - 7 so x = -j. 

(5) 4 + 26 = 56 + 9 so 4 - 9 = 56 - 26 so -5 = 36 and 6 = -f. 

(6) 3(x - 3) = 6 so 3x - 9 = 6 so 3x = 15 and x = 5. 

(7) 3(y -2) = 20-1) so 3>>-6 = 2y-2 so 3>> - 2j = -2 + 6 
so y = 4. 

(8) 2(3a - 1) = 3(4a + 3) so 6a - 2 = 12a + 9 so - 2 - 9 = 12a - 6a 
so - 1 1 = 6a and a = -— . 

(9) 3x- 1 = 2(2x- 1) + 3 so 3x - 1 = 4x - 2 + 3 so - 1 - 1 = 4x - 3x 
so x = —2. 

(10) 20 + 2) = 6/7 - 3(p - 4) so 2/? + 4 = 6p - 3/? + 12 so 4 - 12 = 3/> - 2/? 
and /? = -8. 



(1) 



(2) 



5 5(3 - V'2) 

3 + v2 (3 • x2) (3 \2) 

3 x5 (3 x5)(3 x5) 

3 + v5 (3 + V5) (3 - V5) 



Exercise 2.A.2 

(1) — = 2 so 5x = 3 X 2 = 6 and x = %. 

3 

2x 

(2) 5 + x = — so, multiplying by 3, 15 + 3x = 2x and x = -15. 

3 
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X X 

(3) Multiplying both sides of by 3X4=12 gives 

' x x\ 

12 | =12X1 = 12 so 4x - 3x = 12 and x = 12. 

3 4j 

y (3y - 7) (y - 2) 

(4) Start by putting in the two brackets, so you have — = . 

3 5 6 

Multiply by 30 to get rid of fractions (notice you don't need to use 90). 

Then lOy - 6(3y - 7) = 5(y - 2) so lOy - ISy + 42 = 5y - 10 so 52 = I3y and y = 4. 

(3m - 5) (9 - 2m) 

(5) First, put in brackets to give = 0. 

4 3 

Then multiply by 12 to give 3(3w - 5) - 4(9 - 2m) = 0 (not = 12!) so 9m - 15 - 36 + 8m 
= 0 so 17m = 51 and m = 3. 

x - 1 x -2 

(6) = 1 Putting in brackets and multiplying both sides by 6 gives 

2 3 

/(x-1) (x-2)\ 
6 I — - — - — - — 1 = 6X1=6 so 3(x - 1) - 2(x - 2) = 6 

so 3x - 3 - 2x + 4 = 6 and x = 5. 
p+\ 3 

(7) = — Multiplying both sides by 4(p - 1), and cancelling, gives 

p - 1 4 

4(p + 1) = 3(p - 1) so 4jD + 4 = 3/7 - 3 and /> = -7. 
2 3 

(8) — = Multiplying both sides by y(y +1), and cancelling, gives 

v v • 1 

2(y + 1) = 3y so 2y + 2 = 3j and y = 2. 
4 3 

(9) = Multiplying both sides by (2x + 3) (x - 2), and cancelling, gives 

2x + 3 x - 2 

4(x - 2) = 3(2x + 3) so 4x - 8 = 6x + 9 so -17 = 2x and x = -y. 
2x 3x 

(10) = 1 To get rid of fractions, we must multiply by (x + 2)(x + 5). 

x + 2 x + 5 

Then, cancelling, we get 

2x(x + 5) = 3x(x + 2) - (x + 2) (x + 5). 

(Notice that the -1 has also been multiplied by (x + 2)(x + 5).) 

So 2x 2 + lOx = 3x 2 + 6x - (x 2 + 7x + 10) = 3x 2 + 6x - x 2 - 7x - 10 

10 

so llx = -10 and x = - — . 

11 

2x+l x + 5 3x-l 

3 2 1 

We get rid of the fractions first by multiplying both sides by 3 X 2 X 7 = 42. 
We have 

'2x+ \ x + 5\ (3x- 1 
42 I + = 42 



3 2 \ 1 



2x+l\ x + 5\ (3x-\ 
so 42 I +42 = 42 



Notice that each separate chunk of the equation is getting multiplied by the 42. 
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We then cancel down each fraction in turn to obtain 
14(2* + 1) + 2l(x + 5) = 6(3x - 1) 
28x + 14 + 2lx + 105 = 18x - 6 
28x + 21x- l&c = -14 - 105-6 

125 

so 31jc = — 125 giving x = - . 

31 

x + 3 x - 1 2x—l 

(12) = Putting in brackets and multiplying by 20 gives 

4 5 10 

'x + 3\ /x-l\ (2x-\ 
20 I - 20 = 20 



10 

so 5(x + 3) -4(x - 1) = 2(2x - 1) so 21 = 3x giving x = 7. 
Exercise 2.A.3 

■> s nr 

(1) (a) S = 4jir so r = — and r = \ \ — . 

4ji \ 4ji 

4 , 3F 3 f3K 

(b) V=^Jtr so — = r and r = -»l — • 
4jt \ 4jt 

(2) (a) V=nr 2 h so — - = h. 

nr 

V 7 

(b) V = Ttr A so — = r and r = 

nh 

S- 2jir 2 

(c) S = 2jir + 27tr/z so S - 2nr = 2nrh and = h. 



2nr 



2 2 
V - U 



(3) (a) v = u + 2as so v - u = 2as and = a. 

2s 

(b) f 2 = w 2 + 2<35 so v 2 - 2as = u 2 and u = \V 2 - 2as 

111 111 

(4) — = — + — so — - — = — . 
R _/?| 7?2 R R\ R-2 

Multiplying by R R x R 2 to get rid of the fractions, we get 

RRi 

R 1 R 2 - RR 2 = RR { so R 2 (R 1 - R) = RR t and R 2 



R t -R 



If Ri =3 and R = 2 we have R 2 = 6. We should use a resistance R 2 of 6Q. 



Exercise 2.B.1 

/-3 + 1 2 - 6\ 1-2 + 3 -1 + 4\ 

(1) — — =(-1.-2) (2) — — )-(*§) 



_l+(-4) -5 + (-6) 



5 11, 



(3) ; , ; =H,-T) 



Exercise 2.B.2 

(1) m = -5 (2) y = | x + \ so m = \ (3) _y = — | x + | so m 
(4) J = 4 x + J so m = 4 
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Exercise 2.B.3 



(1) Sketch (c) (2) Sketch (b): y + 4x = 4 so y = -Ax + 4. 

(3) Sketch (a): Ay = x + 4 so y = yc + 1. (4) Sketch (e) (5) Sketch (h) 

(6) Sketch (g) (7) Sketch (d) (8) Sketch (f): y + 2x = -2 so y = -2x - 2 

Notice that appearances can be deceptive. For example, (c), (d) and (h) all look the same 
you take account of the different scales marked on the axes. 
Exercise 2.B.4, 2.B.5 and 2.B.6 all have answers given after Self-test 4. 

Exercise 2.B.7 

/l X (-1) + 2X5 1 X2 + 2X 14\ 

(1) The coordinates are , = (3,10). 

\ 2+1 2+1 / 

(3 X (-2) +1X6 3 X (-3) + 1 X 9\ 

(2) The coordinates are , = (0,0). 

V 1+3 1+3 / 



Exercise 2.C.1 

f 5a -26 = 68 
(1) Multiplying (2) by 2 gives \ 

[ 6a + 2b = 20. 

Adding (1) and (3) gives 11a = 88 so a = 8. 
Substituting in (1) gives 40 - 2b = 68 so b = -14. 
Check in (2): LHS = 24 - 14 = 10 = RHS. 



(2) Multiplying (1) by 5 and (2) by 2 gives 



25p - 10a = 45 
Ap + 10a = -16. 



Adding (3) and (4) gives 29 p = 29 so p = 1. 
Substituting in (1) gives 5 - 2q = 9 so q = -2. 
Check in (2): LHS = 2 - 10 = -8 = RHS. 



(3) First, get rid of fractions by writing 8 X (1) and 3 X (2). 
x - 8j = -20 
9x + y = 39. 

' x-8y = -20 



This gives 



Multiplying (4) by 8 gives 



72x + 8y = 312. 



Adding (3) and (5) gives 73x = 292 so x = 4. 

Substituting in (2) gives 12 + yl3 = 13 so y/3 = 1 and y = 3. 

Check in (1): LHS = \ - 3 = -f = RHS. 

1 1 

(4) Use the same trick which we met earlier of letting — = X and — = Y. 

x y 



Then we have: 



Multiplying (4) by 2 we have 



3X + AY = 0 
2X-2Y= 1. 

3X+AY=Q 
AX-AY= 14. 



Adding these two gives IX = 14 so X = 2. 

3 12 

Substituting in (3) gives 6 + AY = 0 so Y = -5 so x = 5 and j = -5. 
3 4 

Check in (1): LHS = T + — = 6- 6 = 0 = RHS. 

2 ~ 3 
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Exercise 2.D.1 



(1) x 2 + 9x + 14 = (x + 2)(x + 7) = 0 so x = -2 or x = -l. 

(2) x 2 + Ax - 12 = (x - 2)(x + 6) = 0 so x = 2 or x = -6. 

(3) x 2 - 1 lx + 18 = (x - 2)(x - 9) = 0 so x = 2 or x = 9. 

(4) x 2 - x - 20 = (x + 4) (x - 5) = 0 so x = -4 or x = 5. 

(5) 2x 2 + 13x + 6 = (2x + l)(x + 6) = 0 so x = -\ or x = -6. 

(6) 3x 2 - lx - 6 = (3x + 2)(x - 3) = 0 so x = -f or x = 3. 

Exercise 2.D.2 

(1) x = ±3 (2) x = ±\ (3) x - 3 = ±2 so x = + 1 or x = +5. 

(4) 2x-3=±5 so x = 4 or x = -\. 

(5) 3x - 2 = ±6 so x = | or x = -f. 

Exercise 2.D.3 

(1) x 2 + 4x = 21 so (x + 2) 2 - 4 = 21 so (x + 2) 2 = 25 so x + 2 = ±5 
and x = 3 or x = -7. 

(2) x 2 - 6x + 8 = 0 so x 2 - 6x = -8 so (x - 3) 2 - 9 = -8 so (x - 3) 2 = 1 
so x - 3 = ±1 and x = 4 or x = 2. 

(3) x 2 -3x-10 = 0 so x 2 -3x=10 so (x - |) 2 - | = 10 so (x - f ) 2 = f 
so x - J = ± 2 an d x = 5 or x = -2. 

Exercise 2.D.4 

(1) y = x 2 - 4x + 3 = (x - 2) 2 - 4 + 3 = (x - 2) 2 - 1 so the least value of y is -1 which is 
when x = 2, that is, the lowest point on the curve is at (2, -1). 

The v-intercept is (0,3). 

When y = 0, (x - 2) 2 - 1 = 0 so x - 2 = ±1 and x = 3 or 1. 

Therefore the equation x 2 -4x + 3 = 0 has the two roots x = 1 and x = 3, so the curve 
y = x 2 - 4x + 3 cuts the x-axis at (0.1) and (0,3). 

Curve (b) is just curve (a) turned upside down by being reflected in the x-axis, since the 
sign for y is just the opposite way round. 

The sketch for this question is shown beside the one for question (2) below. 

(2) y = x 2 + 2x - 8 = (x + l) 2 - 1 - 8 so the least value of y is -9 when x = -1. The lowest 
point on the curve is at (-1, -9). The v-intercept is (0, -8). When y = 0, (x + l) 2 - 9 = 0 so 
(x + l) 2 = 9 and x + 1 = ±3. The roots of x 2 + 2x - 8 = 0 are x = 2 and x = -4, and the 
curve y = x 2 + 2x - 8 cuts the x-axis at (-4, 0) and (2, 0). Again, curve (b) is just curve (a) 
turned upside down. 




-t 
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Exercise 2.D.5 

-10 ± VlOO-64 -10 ± V36 -10 ± 6 

(1) x = = = = -8 or 

2 2 2 

2 ± v'4 + 32 2 ± 6 

(2) x = = = 4 or -2 

2 2 

-5 ± V25 + 24 -5 ± 7 t 

(3) x = = = - 3 or j 

4 4 



(4) 



-4 ± V16 - 8 4 : xS -4 ± 2V2 



2 2 2 



= -2 + jl or - 2 - V2 = -0.59 or -3.41 to 2 dp. 

1 ± V 1 + 24 1 ± 5 2 
(5) x = = =1 or -3 



1 ± V 1 + 56 1 + V57 1-V57 

(6) x = = or = 2.14 or -1.64 to 2 dp. 

4 4 4 

Exercise 2.D.6 

(1) (a) 2x 2 + 7x + 3 = 0 so (2x + l)(x + 3) = 0 and x = -\ or -3. 

(b) 3x 2 + 4x + 1 = 0 so (3x + l)(x + 1) = 0 and x = -\ or x = -\. 

(c) 2x 2 + x - 4 = 0 gives 'b 2 - 4ac' = l- 4X2X-4 = 33 
so there is no whole number factorisation. 



-1 ± V33 

Using the formula, we have x = =1.186 or -1.686 to 3 d.p. 

4 

(d) 6x 2 - 7x + 2 = 0 so (2x - l)(3x - 2) = 0 and x = \ or x = \. 

(e) x 2 - 5x + 3 = 0 gives 'Z> 2 - 4ac' = 25 - 12 = 13 
so there is no whole number factorisation. 



5 ± VI 3 

Using the formula, we have x = = 4.303 or 0.697 to 3. d.p. 

2 

(f) 6x 2 + 5x - 6 = 0 so (3x - 2)(2x + 3) = 0 and x = § or x = -§. 

(g) x 2 -81=0 so (jc - 9)(jc + 9) = 0 and x = 9 or x = -9. 

Or, you could say, x 2 = 81 so x = ±9. 

(h) 6x 2 - x - 12 = 0 so (3x + 4)(2x - 3) = 0 and x = -f or x = §. 

(i) x 2 - 2 = 0 so x 2 = 2 and x = ±V2 = ±1.414 to 3 .d.p. 

Or, you could say (x + \ 2)(x - \'2) = 0, factorising, which gives the same pair of 
answers as above. 

(j) Factorising x 2 - 5x = 0 we have x(x - 5) = 0 so x = 0 or x = 5. 



Don't be tempted to divide x 2 - 5x = 0 through by x. 

If you do this, you lose the possible answer of x = 0. When x = 0, this division is 
actually impossible because we cannot divide by zero. 



In all the questions above, where I have used factorisation, it is equally acceptable if you 
got your answers by using the formula. 

2x - 3 x - 1 

(2) (a) = - 

2x + 3 x + 1 

Getting rid of fractions by multiplying by (2x + 3)(x + 1), we have 

(2x - 3)(x + 1) = (2x + 3)(x - 1) so 2x 2 - x - 3 = 2x 2 + x - 3 
so 0 = 2x so x = 0. 
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2 1 3 

(b) + = - 

y+\ y-l y 

Getting rid of fractions by multiplying by y(y + l)(y - 1), we have 

2y(y- 1) +y(y + 1) = 3(y + 1)0 - 1) so 2y 2 - 2y + y 2 + y = 3y 2 - 3 

so 7 = 3. 

2x + 4 x - 8 

(c) = 

x + 1 2x - 1 

Getting rid of fractions by multiplying by (x + 1) (2x - 1), we have: 

4x 2 + 6x-4=x 2 -7x-8 so 3x 2 + 13x + 4 = 0 
so (3x + 1) (x + 4) = 0 so x = - | or x = -4. 

Exercise 2.D.7 

The sketches fit to the given equations as follows. 

(1) Sketch (e) (4) Sketch (d) (6) Sketch (c) (7) Sketch (b) 
(8) Sketch (f) (10) Sketch (a) 

The sketches for equations (2), (3), (5) and (9) are shown below. 




Exercise 2.E.1 

(1) y =fix) = 3x 3 + 2x 2 - 3x - 2 

Guessing and substitution show that /(l) = 0 so (x - 1) is a factor, and /(-l) = 0 so (x + 1) 
is a factor. 

Matching up the two sides, we have 

f{x) = 3x 3 + 2x 2 - 3x - 2 = (x - l)(x + l)(3x + 2). 

2 

The roots of f(x) = 0 are 1,-1 and - 3. The y intercept is at (0, -2). 
The coefficient of x 3 is positive, so we have graph 1 below. 
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(2) y =f(x) = 2 + 3x - 3x 2 - 2x 3 

Guessing and substitution show that f(l) and f(—2) = 0 so (x - 1) and (x + 2) are both 
factors. 

Matching up the two sides, we have 

f{x) = 2 + 3x - 3x 2 - 2x 3 = (x - 1) (jc + 2)(-2x - 1) = -(2x + l)(x - l)(x + 2) 

taking out a factor of -1 . 

The roots of /(x) = 0 are - 5, 1 and -2, and the y intercept is at (0, 2). 
The coefficient of x 3 is negative so we have graph 2 on the previous page. 

(3) y =f(x) = 4x 3 - 15x 2 + 12x + 4 

Guessing and substitution give f(2) = 0 so (x - 2) is a factor. There is no obvious second 
root so matching up the two sides, we have 

f(x) = 4x 3 - 15x 2 + 12x + 4 = (x - 2)(4x 2 + px - 2). 

Matching the terms in x 2 gives -15x 2 = -8x 2 + px 2 , so p — -7. 

Checking, using the terms in x, gives 12x = —2px - 2x, so p = -7 is correct. 

Therefore 

f(x) = (x-2) (4x 2 -7x-2) = (x- 2)(x - 2)(4x + 1), 
factorising the second bracket. 

1 

We see that f(x) = 0 has the root x = -4 and the double repeated root of x = 2. Just as we found 
with quadratic equations, this means that the curve of y =f(x) touches the x-axis when x = 2. 
The y intercept is at (0,4) and the coefficient of x 3 is positive, so we get graph 3 below. 




(4) y =f(x) =x 3 - 3x 2 + 3x- 1 

Guessing and substituting shows that /(l) = 0, so (x - 1) is a factor, and there is no obvious 
second root. 

Matching up the two sides gives 

x 3 - 3x 2 + 3x - 1 = (x - l)(x 2 + px + 1). 

Matching up the terms in x 2 gives 

-3x 2 = -x 2 + px 2 , so p = -2. 

Checking, using the terms in x, gives 3x = x - px, so p = -2 is correct. 
Therefore, 

y =f(x) = (x- l)(x 2 - 2x + 1) = (x - l)(x - l) 2 = (x - l) 3 . 

This time, we have a single triply repeated root at x = 1. The y intercept is at (0, - 1). 
The coefficient of x 3 is positive, so we get graph 4 above. 

If you look at this on a graph-sketching calculator, or plot values close to x = 1 for yourself, 
you will see that the curve flattens near x = 1 where the three roots are all bunched together. 
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Exercise 2.E.2 

(1) fix) = x 3 + 2x 2 - 5x - 6 so /(2) = 8 + 8 - 10 - 6 = 0. 

Therefore (x - 2) is a factor and we have x 3 + 2x 2 — 5x — 6 = (x — 2)(x 2 + px + 3). 
Matching the term in x 2 gives 2x 2 = -2x 2 + px 2 , so p = 4. 
Checking, using the term in x gives —5x = -2px + 3x, so p — 4 is correct. 
So fix) = (x — 2)(x 2 + 4x + 3) = (x - 2)(x + l)(x + 3), factorising the second bracket. 



(2) f{x) = 2x 3 - 3x 2 - 8x - 3 so/(3) = 54 - 27 - 24 - 3 = 0, so (x - 3) is a factor. 

We have 2x 3 - 3x 2 - 8x - 3 = (x - 3)(2x 2 + px + 1). 

Matching up the terms in x 2 gives -3x 2 = -6x 2 + px 2 , so p = 3. 

Checking with the term in x gives -8x = —3px + x, so p = 3 is correct. 

So f{x) = (x - 3)(2x 2 + 3x + 1) = (x - 3)(2x + l)(x + 1), factorising the second 
bracket. 



(3) f{x) = 3x 3 + x 2 - 12x - 4 

Testing some values for x, we find that f(2) — 24 + 4 - 24 - 4 = 0, so (x - 2) is a 
factor. We have 3x 3 + x 2 - 12x - 4 = (x - 2)(3x 2 + px + 2). 

Matching up the terms in x 2 gives x 2 = -6x 2 + px 2 , so p = 7. 

Checking, using the term in x, gives -12x = -2px + 2x, so p = 7 is correct. 

So f(x) = (x — 2)(3x 2 + 7x + 2) = (x - 2)(3x + l)(x + 2), factorising the second 
bracket. 

Therefore the solutions of f(x) = 0 are x = 2, x = -5 and x = —2. 



(4) f{x) = 2x 3 + 7x 2 + 2x - 3 

Testing some values, we find /(-l) = -2 + 7 - 2 - 3 = 0, so (x + 1) is a factor. We have 
2x 3 + 7x 2 + 2x - 3 = (x + l)(2x 2 + px - 3). 

Matching up the terms in x 2 gives 7x 2 = 2x 2 + px 2 , so p = 5 

Checking, using the terms in x, gives 2x = px - 3x, so p = 5 is correct. 

So fix) = (x + l)(2x 2 + 5x - 3) = (x + l)(2x - l)(x + 3), factorising the second bracket. 

Therefore the solutions of fix) = 0 are x = -1, x = 1 and x = -3. 



(5) x 4 - 29x 2 + 100 = 0. 

We have a quadratic equation in a beard and dark glasses. 

Putting y = x 2 , we have y 2 - 29y + 100 = 0 so iy - 25)0 - 4) = 0. 

So y = 25 which means that x 2 = 25, so x = ±5, or y = 4 which means that x 2 = 4, 
so x = ±2. 



(6) We have fix) = 5x 3 + ax 2 + bx - 6. (x - 3) is a factor, so /(3) = 0. 
This gives 135 + 9a + 3b - 6 = 0. Therefore 

9a + 36 = - 129 so 3a + b = -43. (1) 

Also, 

/(-2)=-40 so -40 + 4a -26 -6 = -40 so 2a - b = 3. (2) 

(1) added to (2) gives 5a = -40 so a = -8. 

Substituting in (1) gives -24 + b = -43 so b = -19. 

So 5x 3 - 8x 2 - I9x - 6 = (x - 3) (5x 2 + /?x + 2). 

Matching terms in x 2 gives -8x 2 = - 15x 2 + px 2 , so p = 7. 

Checking with the term in x gives -19x = —3px + 2x, so /> = 7 is correct. 

5x 2 + 7x + 2 = (5x + 2)(x + 1) therefore fix) = (x - 3)(5x + 2)(x + 1). 
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(7) The working for the long division is shown below. 



4x t + 4x - 3 

12* - 8x 

Since the division process leaves no remainder, (3x - 2) is a factor of 12x 3 + 4x 2 - 17x + 6. 
Alternatively, substituting x = f gives /(f) = 12(f) 3 + 4(f) 2 - 17(f) + 6 = 0. 
(8) The working for the long division is shown below. 

4x - Z 
2x-\ |€jc*+5x*-$x+I 

-4x+ I 
-4x + 2. 
- I 

1 1 1 ^ 1 9 1 

Alternatively, putting x = j gives /(j) = 6(5) + 5(j) - 8(5) + 1 = -1. 

Chapter 3 

Exercise 3.A.1 

(1) In (a), the volumes are directly proportional to the heights, therefore 

V\ V 2 V A V B V A 4 

— = — so — = — so — . 
h x h 2 4 1 V B 1 

We can see that the volume of A is 4 times the volume of B. 

In (b), the volumes are directly proportional to the (radius) 2 , therefore 

V l v 2 v c V D v c 1 

— = — so — = — so — = — . 
r\ r\ 1 16 V D 16 

Cylinder C has of the volume of cylinder D. 

(2) The kinetic energy is directly proportional to the (speed) 2 , therefore 

E x E 2 E x E 2 E { 25 1 

v\ v\ S ° 25 900 S ° E 2 900 36' 

(3) The volume is directly proportional to the (radius) 3 , therefore 

r\ r\ 8 512 V 2 512 64' 

(4) The time of the swing is directly proportional to the square root of the length, therefore 

T x T 2 Ti T 2 T x i9 3 

4h 4T 2 V9 V25 T 2 sl25 5' 

3 

The time of swing of the first pendulum is 5 of the time of swing of the second pendulum. 
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Exercise 3.B.1 

Here are the functions which you should have found. 

(1) (b) shows f(x) + 2. (c) shows f(x) - 2. 

(2) (b) shows g(x + 2). (c) shows g(x - 2). 

Because we know a — 2, it must be a sideways shift which is happening in these two diagrams. 

(3) (b) shows 2h{x). (c) shows h(2x). 

(4) (b) shows p(x) + 2. (c) shows p(x + 2). 

Exercise 3.B.2 

(1) (a) f(g(x)) =f(2x) = 3(2x) - 5 = 6x - 5 (b) g(f(x)) = g(3x - 5) = 2(3x - 5) = 6x - 10 

(2) (a) f(g(x)) = f(4 - x) = (4 - x) 2 (b) g(f(x)) = g(x 2 ) = 4 - x 2 

(3) (a) f(g(x)) = f(x - 4) = — — , x * 4 (b) g(f(x)) = g(l/x) = 1/x - 4, x * 0 

x - 4 

If you are in doubt about any of these, replace x by 'lump' in the definition of the function to see 
what is happening. Notice that, in (3), we have to exclude the two values of x which would result 
in trying to divide by zero. 

Exercise 3.B.3 

(1) r 1 {x) = \x (2) f- 1 (x)=x + 9 

(3) y = 5x - 9 so y + 9 = 5x and x = f 0 + 9) so f~ 1 (x) = \ (x + 9). 

(4) f{x) is self-inverse so /" 1 (x) = 8 - x. (5) f~ 1 (x) = 4x 

(6) f(x) is self-inverse so f~ 1 (x) = 4/x 

(7) y = 3 - 2x so 2x = 3 - y and x = j (3 - y) so /" 1 (x) = j (3 - x). 

x-3 

(8) Lety = so xy + 2j = x - 3 so 3 + 2y = x - xy = x(l - 7). 

x + 2 

Notice the cunning choice of sides here to avoid lots of minuses. This gives 

3 + 2y 3 + 2x 

x = so f (x) = (x ^ 1). 

1 -y 1 - x 

2x + 3 

(9) Let y = so xy - 2y = 2x + 3 so xy - 2x = 2y + 3 

x - 2 

2y + 3 2x + 3 

so x(y - 2) = 2j + 3 so x = giving / (x) = . 

y — 2 x — 2 

We see that this particular fix) is self-inverse. 
Exercise 3.B.4 

(1) The facts we need for the graph sketch are as follows: 
(a) g(x) = 0 when x = 2. (b) When x = 0, g(x) = -5. 
(c) The value of g(x) is very large and positive if x is just less than -4. 

The value of g(x) is very large and negative if x is just greater than -4. 

x - 2 1 - (2/x) 



(d) g(x) 



x + 4 1 + (4/x) 

y ' 2 4 

so, as x becomes large, the value of g(x) approaches 1, since both — and — 

x x 

become smaller and smaller. 
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The sketch is given in graph 1 below. 

<5>u© 




qfx)= X-Z 
X+4 



The working for the inverse function goes as follows. 



Let y = g(x) 



x + 4 



so xy + Ay = x - 2 so Ay + 2 = x - xy = x(l 



Ay + 2 Ax + 2 

so .v = and g (.v ) 



l-y 

Check: g(4) = ± and ^(j) = 4. 



1 -x 



The facts we need for the graph sketch are as follows: 
(a) h(x) = 0 when x = \. (b) When x = 0, = -5. 

(c) The value of h(x) is very large and positive if x is just less than -1. 

The value of h(x) is very large and negative if x is just greater than -1 . 

(d) As x becomes large, the value of h{x) approaches 2, because 



h(x) = 



2x-5 2- (5/2x) 



x + 1 1 + (1/x) 
The sketch is shown in graph 2 below. 



and both 5/2x and 1/x become very small. 



<+>u(2> 




U(x)- 2x.-S 

DC-*- t 



The working for the inverse function is as follows. 
2x - 5 

Let y = h(x) = so xy + y = 2x - 5 so y + 5 = 2x - xy = x(2 

x + 1 

y+ 5 x + 5 

so x = and h '(x) = 



2-y 



2 - x 



Check: h(3) = \ and hr x (\) = 3. 
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2x + 3 



(3) I show the sketch for f(x) 



x - 2 



below. 




You can see that it is self-inverse because it is symmetrical about the line y = x. 
Exercise 3.C.1 

(1) (a) 4 = 2 2 so log 2 4 = 2. (b) 8 = 2 3 so log 2 8 = 3. (c) 2 = 2 1 so log 2 2 = 1. 

(d) 1 = 2° so log 2 1=0. (e) \ = 2- 1 so log 2 (\) = -1. (f) \ = T 2 so log 2 (±) = - 2. 

(2) (a) 9 = 3 2 so log 3 9 = 2. (b) 81 = 3 4 so log 3 (81) = 4. (c) ^ = 3~ 3 so log 3 (57) = -3. 

(d) \ = 3" 1 so log 3 (|) = -1. (e) 1 = 3° so log 3 1=0. (f) 3 = 3 1 so log 3 3 = 1. 

(g) \ = 3- 2 so log 3 (5) = -2. (h) 27 = 2 3 so log 3 (27) = 3. (i) V3 = 3 1/2 so log 3 (\ 3) = \. 

(3) (a) 100 = 10 2 so log 10 (100) = 2. (b) 1000 = 10 3 so log 10 (1000) = 3. 
(c) 10 = 10 1 so logio (10) = 1- (d) 1 = 10° so logio (1) = 0. 

(e) = so log 10 (jo) = -1. (f) 0.01 = 10~ 2 so log 10 (0.01) = -2. 

Exercise 3.C.2 

(!) (a) log 3 3x = log 3 3 + log 3 x = 1 + l 0 g 3 x. 

(b) log 3 27x 2 = log 3 27 + log 3 x 2 = log 3 3 3 + log 3 x 2 = 3 + 2 log 3 x. 

(c) log 3 (xly) = log 3 x - log 3 y. 

(d) log 3 (x 2 la 2 ) = log 3 x 2 - log 3 a 2 = 2 log 3 x - 2 log 3 a. 

(e) log 3 (ax") = log 3 a + log 3 (x n ) = log 3 a + n log 3 x. 

(f) log 3 (9a x ) = log 3 9 + log 3 (a x ) = log 3 3 2 + x log 3 a = 2 + x log 3 a. 

(g) There is no possible change here. 

(2) (a) logio x + logio (x - 1) = logio (x 2 - x). 

(b) 2 log 10 x - log 10 J = logio (x 2 ) - logio J = logio (x 2 ly). 



(c) log 10 (x + 1) - log 10 (x - 1) = logi 0 




(d) 3 logio x + 2 log 10 y = log 10 (x 3 ) + log 10 (y 2 ) = log 10 (x 3 y 2 ). 
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Chapter 4 

Exercise 4.A.1 

The following are the answers to part (A). 

x 

(1) sin 34° = - so x = 8 sin34° = 4.47 cm to 2 d.p. 



(2) tan 38° = - so y = 5 tan 38° = 3.91 cm to 2 d.p. 

x 

(3) cos 72° = — so x = 15 cos 72° = 4.64 cm to 2 d.p. 

6 6 

(4) tan 54° = - so x = = 4.36 cm to 2 d.p. 

x tan 54° 

3 3 

(5) cos 48° = - so x = = 4.48 cm to 2 d.p. 

x cos 48° 

4 4 

(6) sin 28° = - so y = = 8.52 cm to 2 d.p. 

y sin 28° 

These are the answers to part (B). 

7 5 
(1) tan a = - so a = 60.3° to 1 d.p. (2) sin b = - so b = 38.7° to 1 d.p. 
4 8 

6 8 
(3) cosc = - so c = 48.2° to 1 d.p. (4) sin d = — so d = 53.1° to 1 d.p. 
9 10 

Exercise 4.A.2 

Calling the length of the unknown side x in each case, the answers are as follows: 

(1) x 2 = 4 2 + 7 2 = 16 + 49 = 65 so x= 8.06 to 2 d.p. 

(2) 8 2 = 5 2 + x 2 so x 2 = 64 - 25 = 39 and x = 6.24 to 2 d.p. 

(3) 9 2 = x 2 + 6 2 so x 2 = 81 - 36 = 45 and x = 6.71 to 2 d.p. 

(4) 10 2 = 8 2 + d 2 so d 2 = 100 - 64 = 36 and d = 6. 



Exercise 4.B.1 

(1) First, show the information on a sketch like the one below. 

A 




6 C 

Then ZC = 180° - 78° - 65° = 37°. 

a 5 5 sin 78° 
Also = so a = = 8.13 cm. to 2 d.p. 



And 



sin 78° sin 37° 
b 5 



sin 65° sin 37° 



sin 37° 
5 sin 65° 

so b = = 7.53 cm. to 2 d.p. 

sin 37° 
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(2) First, we draw a sketch, which I've done below. 




Now we can say 

4 6 6 sin 33° 

= so sin ,4= so A = 54.7(8)° = 54.8° to 1 d.p. 

sin 33° sin A 4 

The only problem is that this looks wildly improbable from the sketch above, but the sketch 
does seem to fit the known facts quite well. What has gone wrong? 

In fact, as you may already have realised, the known facts fit two possible triangles. Can 
you draw them both? 



I've drawn sketches for both of them below in (a) and (b). 

(I cheated by only giving one of them in my solution above; you may either have spotted 
the snag, or have sketched (b), or sketched (a) as I did.) 

In (b), I have drawn a dotted line showing where the side AB of (a) would come, so you 
can see how it has been swung round from B to give the other possible position. 




Now you have the right-hand sketch, you can see that the other possible answer for LA 
is 180° - 54.78° = 125.2° to 1 d.p. and this is the value for triangle (a). 

Your calculator will give you identical values for the sin of 54.8° and 125.2°. 

How it is actually possible to have the sin of an angle greater than 90° will be explained 
later, in Section 5.A.(c). 

Next, we find the other measurements for each triangle in turn. 

In AABC (a), LB = 180° - 33° - 125.22° = 21.78° = 21.8° to 1 d.p. 

b 4 

and = so b = 2.73 cm to 2 d.p. 

sin 21.78° sin 33° 

(working with 2 d.p. to avoid rounding errors in the answer). 

In AABC (b), LB = 180° - 33° - 54.78° = 92.22° = 92.2° to 1 d.p. 

b 4 

and = so b = 7.34 cm to 2 d.p. 

sin 92.22° sin 33° 

The two sets of answers now fit the two drawings in believable ways. 

We met just this same situation of ambiguous information giving us two possible 
triangles in case (4) of Section 4.A.(e) on congruent triangles. 
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(3) First we draw a sketch like the one below. 




This time, there is only one possible diagram because if we swing AB around B it only 
cuts AC again the other side of C. Now 

9 5 5 sin 40° 

= so sin A = so A = 20.9(2)° to 1 d.p. 

sin 40° sin ,4 9 

making a note of the second decimal place for use in further calculations. Therefore 
B = 180° - 40° - 20.92° = 119.08° = 119.1° to 1 d.p. 

Assuming from question (2) that it's all right to use sin 119.08° from your calculator to 
find b, you get 

b 9 9 sin 119.08° 

so b = = 12.24 cm to 2 d.p. 



sin 119.08° sin 40° 
Exercise 4.B.2 



sin 40° 



a 2 + b 2 - c 2 64 + 49 - 25 
(1) (a) cos C = = so C= 38.2° to 1 d.p. 



lab 



112 



(b) a 2 = b 2 + c 2 - 2bc cos A = 25 + 64 - 80 cos 72° so a = 8.02 units to 2 d.p. 



b 2 + c 2 -a 2 49 + 9 - 81 
(c) (i) cos A = = so cos A 



(ii) cos B = 



2bc 



42 

9 + 81 -49 41 



23 
42 



and ,4 = 123.2(0)° 



lea 54 54 

(iii) ZC = 180° - 123.20° - 40.60° = 16.2° to 1 d.p. 

(2) (a) Zg = AR = 30°. (b) AQPR = 120°. 

2 2 + 2 2 - (2V3) 2 8-12 1 



so B = 40.6(0)°). 



(c) cos AQPR 



2X2X2 
1 



so cos 120° = - - = - cos 60°. 
2 



(d) 



2 V3 



sin 120° sin 30° 



2 V3 V3 

so sin 120° = = — = sin 60°. 

4 2 

Exercise 4.C.1 

(1) The centre is at (1, -2) and the radius is 4 units. 

(2) x 2 - 2x + y 2 - Ay = 0 = (x - l) 2 - 1 + (y - 2'f - 4 so (x - l) 2 + (y - 2) 2 = 5. 
The centre is at (1,2) and the radius is \ 5 units. 
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(3) x 2 - 8x + y 2 + 7 = 0 = (x - 4) 2 - 16 + y 2 + 7 so (x - 4) 2 + y 2 = 9. 
The centre is at (4,0) and the radius is 3 units. 

(4) x 2 - 6x + y 2 + 2y - 6 = 0 = (x - 3) 2 - 9 + (y + l) 2 - 1 - 6 
so (x- 3) 2 + (y + l) 2 = 16. 

The centre is at (3, - 1) and the radius is 4 units. 

(5) x 2 - x + y 2 + y = 0 = (x - \f - \ + (y + \f - \ = 0 
so (x - j) 2 + (y + j) 2 = \. 

The centre is at {\, - \) and the radius is l/\i2 = \i2/2 units. 

(6) x 2 + 3x + y 2 + 2y + 1 = 0 

so (x + \ f - I + 0 + l) 2 - 1 + 1 = 0 so (x + \ f + 0 + l) 2 = I 

3 3 

The centre is at (- 5, -1) and the radius is j units. 

(7) The circle x 2 + y 2 + 2x - 4y = 0 can be rewritten as (x + l) 2 + (y - 2) 2 = 5 so its centre is 
at the point (-1, 2). 

This is also the centre of the new circle, but the radius of the new circle is 5 units. 
Therefore its equation is (x + l) 2 + (y - 2) 2 = 5 2 or x 2 + 2x + y 2 - 4y = 20. 

(8) The equation of the circle is x 2 - 2ax + y 2 - 2by + c = 0. 

It passes through the point (0,0), therefore putting x = 0 and y = 0 must satisfy its 
equation. Doing this gives us c = 0. 

The point (3,0) also lies on the circle. Putting x = 3 and y = 0 gives 9 - 6a = 0 so a = j- 

The point (0,4) also lies on the circle. Putting x = 0 and y = 4 gives 16 - %b = 0 so 
b = 2. Therefore the equation of the circle is x 2 - 3x + y 2 - 4y = 0. 

Rewriting the equation as (x - | ) 2 + (y - 2) 2 = ^ by completing the squares, gives its 
centre as the point (j, 2) and its radius as j units. 

There is also a neat geometrical way to do this question. 

The sketch below shows the three points A, O and B which the circle must pass through. 
Now, jLAOB = 90° so it is an angle in a semicircle (Section 4.C.(c)) so AB is a diameter of the circle. 
Therefore, the centre C must be the point (j, 2) and the radius must be f units, since the length 
of AB is 5 units by Pythagoras' Theorem. 

This then gives us the same equation as the method using algebra. 




Exercise 4.C.2 

(a) 3y = x - 5 so x = 3y + 5. 

Putting x = 3y + 5 in the equation of the circle gives 

(3y + 5) 2 - 6(3y + 5) + y 2 - 2y + 5 = 0 
so 9>> 2 + 30y + 25 - 18y - 30 + y 2 - 2y + 5 = 0. 

Therefore \0y 2 + lOy = 0 so y 2 + y = 0 so y(y + 1) = 0 so y = 0 or y = -1. 
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^^^^ Remember not to divide through by y in the equation above of y 2 + y - 0. If you do 
^B^H this, you lose the answer of y - 0 (for which this division would have been 
impossible). 



If y = 0 then x = 5 and ify = -l then x = 2 so the line 3j> = x - 5 cuts the given circle at 
the two points (5, 0) and (2, -1). 

(b) Substituting x = 2y - 4 in the equation of the circle gives 

(2y - 4) 2 - 6(2y - 4) + y 2 - 2y + 5 = 0 
so 4y 2 - I6y + 16 - I2y + 24 + y 2 - 2y + 5 = 0 so 5y 2 - 30y + 45 = 0 
so y 2 - by + 9 = 0 so (y - 3) 2 = 0. 

The repeated root of y = 3 shows that the line 2y = x + 4 is a tangent to the circle. When 
j = 3, x = 2 so its point of contact is (2, 3). 

(c) Substituting y = 2x + 3 in the equation of the circle gives 

x 2 - 6x + (2x + 3) 2 - 2(2x + 3) + 5 = 0 

so x 2 - 6x + 4x 2 + 12x + 9 - 4x - 6 + 5 = 0 so 5x 2 + 2x + 8 = 0. 

Putting a = 5 and b = 2 and c = 8 and using the quadratic equation formula gives b 2 - 4ac 
= 2 2 - 160 =-156. 

Therefore this equation has no real roots and the line y = 2x + 3 does not cut this circle 
at all. 

For the sketch, we write x 2 - 6x + y 2 - 2y + 5 = 0 as (x - 3) 2 + (y - l) 2 = 5 so the 
centre of the circle is at the point (3, 1) and its radius is \' 5 units. I show the sketch of this 
circle and the three lines below. 




Exercise 4.C.3 

x 2 + 16x + y 2 - 4y - 101 = 0 can be written as 

(x + 8) 2 -64 + (j-2) 2 -4- 101 =0 or (x + 8) 2 + (y - 2) 2 = 169 

so its centre is at the point (-8, 2) and its radius is 13 units. This makes it possible to draw the 
sketch on the next page. 
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(a) The radius which joins (-8, 2) to (4, -3) has a gradient of 

-3-25 12 
so the gradient of tangent (a) is 



4 -(-8) 12 5 

12 

It passes through (4, -3) so its equation is y + 3 = — (x - 4) 

or 5y + 15 = 12x - 48 or 5y = I2x - 63. 

(b) The radius which joins (-8, 2) to (-3, 14) has a gradient of 

14-2 12 5 

= — so the gradient of tangent (b) is . 

-3 -(-8) 5 12 

5 

It passes through (-3, 14) so its equation is y - 14 = - — (x + 3) 

12 

or I2y - 168 = -5jc - 15 or 12y+5x=153. 

(c) The radius which joins (-8, 2) to (-21, 2) has a gradient of 

2-2 

= 0. 



-21 - (-8) 

From the sketch we see that this radius has zero gradient because it is horizontal. Therefore 
tangent (c) is vertical and has the equation x = —21. 

(d) The radius which joins (-8, 2) to (-8, -11) has a gradient of 
13 

— so it is a fraction which is undefined. 
0 

Looking at the sketch shows us that this radius is vertical, so tangent (d) is horizontal 
and its equation is y = -11. 

Exercise 4.D.1 

The missing measurements are as follows: 

ji 3tt 5ji 4jt 3jt 

30°, 45°, -, 120°, — , — , n, 210°, — , — , 315°, 2ji. 
3 4 6 3 2 
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Exercise 4.D.2 

(1) (a) The arc length = r6 = 5 X ji/6 = 2.62 cm to 2 d.p. 

(b) The area of AAOB = \r 2 sin d = \ X 25 X sin (tt/6) = 6.25 cm 2 . 

^^^^k Remember that you must set your calculator in radian mode before you find the sin 
^B^H of the angle. 

(c) To find the area of the segment, we first find the area of the sector. 
This is \ r 2 8 = \ X 25 X (jr/6) = 6.545 cm 2 . 

So the area of the segment is 6.545 - 6.25 = 0.30 cm 2 to 2 d.p. 

(2) I will follow my own recommendation here and work in radians. If you don't, you must use 
the formula for the area of a sector given in Section 4.D.(d). 

I start by saying that 60° is the same as tt/3 radians. 

The area of the whole circle is Jt(3 2 ) = 28.274 m 2 . 

The area of the minor sector AOB is jr 2 $ = \ x 3 2 X jt/3 = 4.712 m 2 . 

So the area of the shaded part of the circle is 28.274 - 4.712 = 23.56 m 2 to 2 d.p. 

The answer to the thinking point is 90° or nil. 

The reason for this is that the area of the triangle is given by A — y 2 sin 6. The radius r is a 
fixed length, so the maximum area is obtained when sin 8 has its greatest value of one when 8 = 
nil. The largest area the triangle can have is 5 r z . 

Chapter 5 

Exercise 5.B.1 

Here is the sketch of y = cosec x drawn using the graph of y = sin x. 

The two graphs touch each other whenever sin x = ±1, and the graph shows that cosec x becomes 
very large whenever sin x approaches zero. The vertical lines are called asymptotes. The curve 
becomes very close to them near its jumps or discontinuities. 
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Exercise 5.B.2 



Here is the sketch of y = cot x drawn using the graph of y = tan x. I have shown y = tan x with a dashed 
line so that you can see y = cot x more easily. 




It is a kind of mirror image ofy = tan x since it behaves in exactly the opposite way; going off to 
infinity when tan x = 0, and itself equalling zero when tan x goes off towards infinity. (These last two 
properties are always true for reciprocal graphs, which helps when sketching them.) However, if you 
look at the graph ofy = tan x in a mirror, you will see that you also have to slide the mirror image 
to the right (or left!) by nil in order to get the graph of y = cot x. 



Exercise 5.B.3 

Here are the sketches which you should have. 

The original graphs are shown with dashed lines and the reciprocal graphs are shown with 
solid lines. 




Notice what a difference the two zeros for y =f(x) make to the reciprocal graph in (2). 
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In (3), using the rules for sketching reciprocal graphs gives us the graph we already know the shape 
of from Section 3.B.(g). 




In (5), y = \le x = e~\ 

x + 3 x - 2 

In (6), the two graphs cross each other where = . 

x - 2 x + 3 

Therefore (x + 3) 2 = (x - 2) 2 so x 2 + 6x + 9 = x 2 - 4x + 4 so lOx = -5 and x = -\. 
Substituting x = -j in either f(x) or l/f(x) gives y = -1, so the two graphs cut at the point 

(4,-1). 

It's worth comparing the graph for this reciprocal function with the graph which we sketched in 
Section 3.B.(i) of the inverse function of fix). 
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Exercise 5.C.1 



I show below the sketches you should have. Each sketch gives the answers to the individual questions 
for that function. 

In (4), because the curve has been shifted up by one unit, it is no longer odd. It no longer fits onto 
itself if it is rotated through a half turn about the origin. 

In (6), sin (t + jr/3) gets to every value faster by jt/3. The sin curve has been shifted ji/3 to the left. 
We say that the two curves are out of phase by jt/3. 





A*2, T=2tt, cozl, 



A = ), T=TT, f*Mr, €0*2, odd 




A* I, T=4tt, /= y+v, co=£,odd 
© jc= cos t 



AX 




A=l,T=27r ) j^Hn, tort, odd. 
Chapter 5 




A- l y Tr a*, j co=l, neither. 

© x= Cos(fc+%) 




AM, T=2ir, /=>i T/ w=i,odd. 
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Exercise 5.C.2 



Here are the eight sketches which you should have drawn. Each sketch shows the position of P after 
time t and the corresponding length of x which I have drawn using a heavy black line. 1 have also 
shown the starting position of P when t = 0 on each sketch. 





-5 
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Exercise 5.D.1 

(1) \'3 cos t - sin t = R cos (t + a) = R cos t cos a - R sin t sin a 

so \ 3 = R cos a and 1 = R sin a so R = 2 and tan a = l/\ 3 giving a = ji/6. 
Therefore x = V3 cos t - sin f can be written as x = 2 cos (f + jt/6). 

3CA 




We have ^ = 2, co = 1 and T = 2ji. 

(2) 5 cos t + 12 sin t = R cos (r-a) = i? cos ? cos a +i? sin t sin a so 5 =Rcosa and 12 =i? sine? 
so R = 13 and a = tan" 1 (y) = 1.176. 

Therefore x = 5 cos t + 12 sin t can be written as x = 13 cos (f - 1.176). 

A*. 




We have A = 13, co = 1 and T = 2it. 

(3) 15 cos t - 8 sin t = ^ cos (f + a) = R cos f cos a - R sin * sin a 

so 15 = ^ cos a and 8 = R sin a so R = \1 and a = tan -1 (y) = 0.490. 

Therefore x = 15 cos ? - 8 sin ? can be written in the form x = 17 cos (f + 0.490). 




We have v4 = 17, co = 1 and T = 2tt. 
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(4) 2 cos t - 3 sin t = R cos (t + a) = R cos f cos a - R sin ? sin a 

so 2 = R cos a and 3 = R sin a so R= V 13 and a = tan -1 ^) = 0.983. 
Therefore x = 2 cos / - 3 sin f can be written as x = \ 13 cos (t + 0.983). 




We have A = 4l3, w = 1 and T = In. 

(5) cos At - sin At = R cos (At + a)=R cos At cos a - sin 4f sin a so 1 = ^ cos a and 1 = R sin a 
so R = \ 2 and a = tan -1 1 = jv/A. 

Therefore x = cos At - sin 4/ can be written as x = V2 cos (At + ;r/4). 



AX 




notice fcKot one cycle takes 5. 

We have A = 4l, w = A and T = 2nlA = nil. 

(6) v'3 sin 3/ - cos 3t = R sin (3/ - a) = R sin It cos a — R cos 3? sin a 

so \ 3 = R cos a and 1 = R sin a so = 2 and a = tan -1 (1/V3) = jt/6. 
Therefore x = ^3 sin 3t - cos 3/ can be written as x = 2 sin (3t - ji/6). 




notice. tKot one cycle takes ^Lrt/ 3 5. 



We have A = 2, to = 3 and T = 2nl3. 
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Exercise 5.E.1 



This is checked by calculator. 



Exercise 5.E.2 

We have 2 cos 2 x + 3 cos x + 1 = 0 so (2 cos x + l)(cos x + 1) = 0. 

(a) Either cos x = —\ so the principal value is 120°, or cos x = —I so the principal value is 180°. 

(b) The solutions between 0° and 360° are 120° and 360° - 120° = 240°, and 180°. 

(Notice that there are only three solutions here in the given range!) 

(c) The general solution is x = 360° n ± 120° and x = 360° n ± 180°. 

All the answers given by 360°« ± 180° are included if we just write 360° n + 180°. 
If you sketch the graph of y = cos x with the line y = —\, you will see that the line y = —\ is a tangent 
to the curve of cos x and so it is only giving us single points of intersection on each cycle. 



Exercise 5.E.3 

Using tan 2 x + 1 = sec 2 x gives us tan 2 x + 2 tan x - 3 = 0, so (tan x - 1) (tan x + 3) = 0. 

(a) Either tan x = 1 giving the principal value of x = 45°, or tan x = -3 giving the principal 
value of x = -71.57° = -71.6° to 1 d.p. 

(b) The solutions between 0° and 360° are 45°, 180° + 45°, 180° + (-71.57°) and 
360° + (-71.57°) giving 45°, 225°, 108.4° and 288.4° to 1 d.p. 

(c) The general solution to 1 d.p. for all possible angles is given by x = 180°« + 45° and 
x = 180°« + (-71.6°) = 180°« - 71.6°. 

Exercise 5.E.4 

We have cos 2 x + 2 sin x = 1 so (1 - sin 2 x) + 2 sin x = 1 so 0 = sin 2 x - 2 sin x. 



5 



Don't divide through by sin x here, so giving yourself 0 = sin x - 2. If you do this, 
you have ignored the possibility that sin x = 0. 

Instead, we factorise, getting 0 = sin x (sin x - 2) so either sin x = 0, or sin x = 2 
which is impossible. 



(a) sin x = 0 gives a principal value of 0 radians. 

(b) The solutions from 0 to 2ji are 0, n and 2tt. 

(c) The general solution is x = nil radians where n is any whole number. 
Exercise 5.E.5 



(1) 


(a) 


48.2° 


(2) 


(a) 


78.7° 


(3) 


(a) 


2jt/3 


(4) 


(a) 


-jt/4 


(5) 


(a) 


23.6° 



In the following questions, I've used PV to stand for 'principal value'. 
(6) Using sin 2 x + cos 2 x = 1 gives 6(1 - cos 2 x) + 5 cos x = 1 

so 6 cos 2 x - 5 cos x + 1 = 0 so (3 cos x - 1)(2 cos x - 1) = 0. 

(a) cos x = \ so x = 1 .23 (PV) or cos x = \ so x = ji/3 (PV). 

(b) The solutions between 0 and 2ji are 1.23 and 5.05 and jt/3 and 5tt/3. 

(c) The general solution is x = 2nji ± 1.23 and x = 2rm ± jt/3. 
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(7) tan 2 x = tan x so tan 2 x - tan x = 0 so tan x (tan x - 1) = 0. 

Either tan x = 0 giving a PV of 0 so that the general solution is x = rut. 
This gives the solutions 0, Jt or 2ji if 0 < x < 2jt. 

Or tan x = 1 giving a PV of Jt/4 so the general solution is x = nil + jt/4. 
This gives the solutions ti/4 and 5tt/4 if 0 < x < 2jt. 

(8) Using the identity tan 2 x + 1 = sec 2 x we get 2 tan 2 x = 1 so tan x = ±\'' 1/2. 

If tan x = l/v'2, the PV is 35.3° and if tan x = -l/\/2, the PV is -35.3°. 
The general solution is x = 180°« ± 35.3°. (This puts together both the principal values 
which we have found.) 

The solutions between 0° and 360° are 35.3°, 144.7°, 215.3° and 324.7°. 

(9) Using sin 2x = 2 sin x cos x gives 2 sin x cos x - 3 cos x = 0 so cos x (2 sin x - 3) = 0. 

Either sin x = j which has no solution, or cos x = 0 giving a PV of x = nil. 
This gives a general solution of x = 2rm ± jt/2 so x = ji/2 or 3?r/2 if 0 < x < 2jt. 

(10) Using Section 5.D.(h) gives sin 5x + sin x = 2 sin 3x cos 2x = 0. 
So either sin 3x = 0 or cos 2x = 0. 
If sin 3x = 0 the PV of 3x = 0° so the PV of x = 0° 
The general solution is 3x = 1 80«° or x = 60n°. 

The solutions between 0° and 360° are 0°, 60°, 120°, 180°, 240°, 300° and 360°. 
If cos 2x = 0, the PV of 2x = 90° so the PV of x = 45°. 
The general solution is 2x = 360«° ± 90° or x = 180«° ± 45°. 
The solutions between 0° and 360° are 45°, 135°, 225° and 315°. 

Exercise 5.E.6 

(1) Notice that we are working in radians here. 

In Section 5.D. (g) I showed that 3 cos t - 2 sin t = \13 cos(? + a) where a = 0.588 
radians to 3 d.p. 

(a) This has no solutions since we can't have cos t > 1 . 

(b) This equation gives cos (f + a) = 1 so the principal value is t + a = 0. 

This gives the general solution that t + a = 2mi so t = 2mt - a, with a = 0.588. 
If 0 < t < 2jt, we get t = 5.70 to 2 d.p. 

(c) The equation gives V13 cos (t + a) = 1 so cos (t + a) = l/V 13 which gives the principal 
value for (t + a) of 1.290 radians to 3 d.p. 

The general solution for (t + a) is given by t + a = 2rm ± 1.290. 
Putting in a = 0.588, the solutions between 0 and 2ji are given by putting n = 0 and 
n = 1. 

These solutions are t = 0.70 and t = 4.41 to 2 d.p. 
I show all these answers in the sketch below. 




(2) In Section 5.D.(g) we showed that 3 sin 2t + cos 2t = V 10 sin (2t + a) where a = tan 1 \ = 
18.43° to 2 d^ 

We have V 10 sin (2t + a) — 2 so the principal value for (2t + a) is 39.23°. 
The general solution for 2t + a is 180°« + (- 1)" (39.23°), so 
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2t = 180°« + (-1)" (39.23°) - 18.43° 
giving 

t = 90°n + (-1)" (19.62°) - 9.22°. 

Putting n = 0, n = \, n = 2 and n = 3 gives the solutions between 0° and 360° of 
t = 10.4°, 61.2°, 190.4° and 241.2° to 1 d.p. 

Chapter 6 

Exercise 6.B.1 

(1) (a) 2 + 9 + 16 + . . . + 107 

(i) a = 2 and d=l. 

(ii) 107 = a + (n - l)d = 2 + 7 (n - 1), so In = 1 12 and n = 16. 

(iii) S 16 = 16/2 (2 + 107) = 872. 

(b) 100 + 95 + 90 + . . . + 15 

(i) a = 100 and d = -5. 

(ii) 15 = a + (n - \)d = 100 + (n - l)(-5) so 5n = 105 - 15, and n = 18. 

(iii) 5 18 = f (100 + 15) = 1035. 

(c) 6 + 6\ + 6\ + . . . + 17j 

(i) a = 6 and d = 7. 

3 1 3 1 

(ii) 17j = a + (n - \)d = 6 + (« - l)j so 1 14 = j(n - 1), and 47 = n - 1, 
therefore « = 48. 

(iii) 5 48 = 48/2 (6 + 71/4) = 24 x6 + 6x 71 = 570. 

(2) (a) 1 + 2 + 3 + . . . + 100 

a = 1 and d = 1 and n = 100. 

Sum = ir(l + 10 °) = 505 °- 

(b) 2 + 4 + 6 + . . . + 100 

a = 2 and <s? = 2 and « = 50. 
S s0 = f (2 + 100) = 2550. 

(c) The sum of the odd numbers up to 100 is 5050 - 2550 = 2500. 

(d) 1+2 + 3 + 4 + .. .+« 

a = 1 and I = n and the number of terms is n so S„ = \n{\ + n). 
This is an often-used rule and it often appears in formula books. 

(3) a = 1 1 and S 1S = 1269 so we have 1269 = 9(22 + lid). 
This gives 1269 = 198 + 153d so d = 7. 

(4) a = 7 and d = 4. Let 5„ = 1375. We have to find what n is. 
We can say 

1375 = \n{\4 + (n- 1)4) = \n (10 + An). 

Tidying up gives us 

2n 2 + 5n- 1375 = 0. 

-5 ± V25 + 11000 -5 ± 105 

So n = = = 25 or -27.5. 

4 4 

Since the number of terms of a series must be a positive whole number, the answer is 25. 

(5) The third term is twice the first term, so a + 2d = 2a giving d = \a. Also 

10 / (a\\ (4a + 9a 

— \2a + 9 - 
2 \ \2 

This gives 

39 = 13a/2 so a = 78/13 = 6 and d = 3. 
Exercise 6.C.2 

7 25 1 401 11 

(1) 0.7 = — (2) 0.25 = = - (3) 0.401 = (4) 0.011 = 

10 100 4 1000 1000 



S l0 = 195 = — I 2a + 9 ( - 1 1 so 390 = 10 
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(5) Let F = 0.7777 . . . Then 10F = 7.7777 . . . 

7 

Subtracting, we have 9F = 7 so F = 5. 

(6) Let F = 0.292929. . . Then 100F = 29.292929 . . . 
Subtracting, we have 99F =29 so F = %. 

(7) Let F = 2.5343434. . . Then \00F = 253.4343434 . . . 

250.9 2509 

Subtracting, we have 99F = 250.9 so F = = . 

99 990 

(8) If F = 40.2106, then, multiplying by 1000, we have 

10007^= 40 210.6106106106. . . 
F= 40.2106106106... 

40170.4 401704 

Subtracting, we get 999F = 40170.4 so F= = . 

999 9990 



(9) If F = 0.142857, then multiplying by 1 000 000, we have 
1 000 000F = 142 857. 142857 . . . 

F= 0.142857... 

142 857 1 

Subtracting, we get 999 999F = 142 857, so F = = — rather amazingly. 

999 999 7 

12 3 

The digits of the decimal forms of 7, 7, 7, etc. make interesting patterns. You might like to 
look at these for yourself. 

Exercise 6.D.1 

9 11 r 29 1 

(1) 1 r 2 (2) I (3) I 

r=i r=\ r + 1 »•=! r(r + 1) 

(4) We want the terms to alternate in sign, with the odd terms being negative and the even 

terms positive. We can make this happen by multiplying each term by something which flips 
sign in this way. (-l) r will fit our requirements exactly. (This is what we used when we 
wrote down the general solution for a sin in Section 5.E.(d).) So we write 

i (-iy> 2 . 

r= 1 

Exercise 6.D.2 

(1) The first four terms are 5 + 7 + 9 + 1 1 = 32. (An AP!) 

The nth term is In + 3, and the (n + l)th term is 2(« + 1) + 3 = In + 5. 

4 1 

(2) The first four terms are 36 + 12 + 4 + 3 = 533, giving a GP this time. (Remember that 
(5) 0 = 1 from Section l.D.(b).) 

The nth term is 36(f)"- The (n + l)th term is 36(f)". 

(3) The first four terms added are 1 + \ + \ + 55 = 1 fi 

1 1 
The wth term is — and the (n + l)th term is 



n! (n+1)! 
(I gave the meaning of n\ at the end of Section 6A.(a).) 
(4) The first four terms added are + f- | + f- g = -^5. 

The wth term is (— — ] + and the (n + l)th term is ["-— - ) (-1)" 
\n + 2J \« + 3/ 

replacing n by n + 1 in the previous formula. 
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11 1 1 11114 

(5) The first four terms are + + + that is, — + — + — + — = — . 

1X3 3 X 5 5 X 7 7 X 9, 3 15 35 63 9 

1 

The «th term is 



The in + l)th term is 



Exercise 6.D.3 



(2b - 1) (2b + 1) 

1 1 



(2(b + 1) - 1) (2(n + 1) + 1) (2b + 1)(2b + 3) 



(1) X (r - l)(r + 3) = 2 (r 2 + 2r - 3) 

r=\ r=\ 

n n n 

= 2 r 2 + 21 r- Z 3 

r=\ r=\ r=\ 

= \n(n + 1)(2b + l) + 2{\n(n + 1)) - 3b using sums (S 2) and (51) 
= \n{{n + 1)(2b + 1) + 6 (b + 1) - 18) 
= ^b(2« 2 + 3«+l+6« + 6-18) 
= \n(2n 2 + 9n- 11). 

Check: If b = 3 

LHS = I (r - l)(r + 3) = (0)(4) + (1)(5) + (2)(6) = 17. 

r=\ 

Putting b = 3 in the RHS gives g (3)(2 X 3 2 + 9 X 3 - 1 1) = \ (18 + 27 - 1 1) = 17. 

(2) £ r(r - l)(r + 1) = 2 r(r 2 - 1) = £ (r 3 - r) 

= £ r 3 - £ r = \n 2 (n + l) 2 - j«(« + 1) using (S3) and (S 1). 

r=\ r=\ 

Factorising, we get 

\n{n + 1)(b(b + 1) - 2) = \n(n + 1)(« 2 + n - 2). 
Check: If b = 3 

LHS = £ r(r - l)(r + 1) = l(0)(2) + 2(l)(3) + 3(2)(4) = 30. 
Putting b = 3 in the RHS gives j (3)(4)(9 + 3 - 2) = 30. 

These checks are useful not only to be confident that your working is correct, but also 
because they give you good practice in handling series, and seeing the terms building up 
into the sums. 



Exercise 6.E.1 

4 A B 



(1) 



(x + 2)(x + 3) x + 2 x + 3 
so 4 = A(x + 3) + B(x + 2). 

Putting x = -3, we get 4 = -B, so 5 = -4. 

Putting x = -2, we get 4 = A, 

Check with x = 0: LHS = 4 and RHS =12-8=4. 



So 



(x + 2)(x + 3) x + 2 x + 3 
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6 A B 

(2) + 

(2y - l)(2y + 1) 2y - 1 2>> + 1 

so 6 = ^(2j + 1) + B(2y - 1). 

Putting j> = \, we get 6 = 2A, so ^4 = 3. 
Putting y = -j, we get 6 = -21?, so B = -3. 
Check with y = 0: LHS = 6; RHS = 3 + 3 = 6. 

6 3 3 

So 



(2y - l)(2y + 1) 2y - 1 2y + 1 

10 A B C 

(3) + 



x(x - l)(x + 4) x x - 1 x + 4 
so 10 = ^(x - l)(x + 4) + Bx(x + 4) + Cx(x - 1). 

Putting x = 1, we get 10 = 5B, so 5 = 2. 
Putting x = -4, we get 10 = 20C, so C = j- 
Putting x = 0, we get 10 = -4A, so A = — . 

Check with x = 2, say. (We can't use x = 0 as we've used it already.) 
We get LHS = 10 and RHS = (-§)6 + 2(12) + \(2) = 10. 

10 \ 2 \ 

So = -- + + 

x(x - l)(x + 4) x x - 1 x + 4 



When you use partial fractions for integrating, it is usually better to keep A, B and C 
as fractions on top of the original fractions, so 1 shall leave my answers in the form 

\ 1 

rather than . 

x + 4 2(x + 4) 



Exercise 6.E.2 

5 ABC 
= + + 

(x - 2)(x + 3) 2 x - 2 x + 3 (x + 3) 2 

so 5 = A(x + 3) 2 + B(x - 2)(x + 3) + C (x - 2). 

Putting x = -3, we get 5 = C(— 5) so C = -1. 
Putting x = 2, we get 5 = A(5 2 ) so ^4 = 55 = 5. 

Comparing the terms in x 2 , we have 0 = ^4x 2 + Bx 2 so B — —A — —j. 

Checking with x = 0, we get the LHS = 5 and the RHS = \(9) + (-5X-6) + (-l)(-2) = 5. 



(x-2)(x + 3) 2 x-2 x + 3 (x + 3) 2 ' 

2 ABC 
= — + — + 

y 2 (y-i) y y 2 y-i 

so 2= Ay(y - 1) + B(y - 1) + Cy 2 . 

(y 2 works just like any other repeated factor.) 
Putting y = 0, we get 2 = -B so B = -2. 
Putting y = 1 , we get 2 = C. 

Comparing the terms in y 2 , we have 0 = Ay 2 + Cy 2 so A = -C so A = -2. 
Checking with y = 2, we get the LHS = 2 and the RHS = (-2)(2) + (-2)(1) + (2)(4) = 2. 

2 2 2 2 

So — = --- — + . 

y 2 (y-\) y y 2 y-\ 
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Exercise 6.E.3 



14 Ax + B C 

(1) —, = + 

(x 2 + 3)(x + 2) x 2 + 3 x + 2 

so 14 = (Ax + B)(x + 2) + C(x 2 + 3). 

Putting x = -2, we get 14 = 1C so C= 2. 
Putting x = 0, we get 14 = 2B + 3C so IB = 8 and 5 = 4. 
Comparing the terms in x 2 , we have 0 = ^4x 2 + Cx 2 so A — -C = -2. 
Checking with x = 1, the LHS = 14, and the RHS = (-2 + 4)(3) + 2(4) = 14. 

14 (-2x + 4) 2 4-2x 2 

So = + = + . 

(x 2 + 3)(x + 2) x 2 + 3 x + 2 x 2 + 3 x + 2 

The second form looks a bit tidier. 

4 ABy + C 

(2) = - + — 

y(y 2 +i) y y 2 + i 

so 4 = ^(y 2 + 1) + (By + C)y. 
Putting y = 0, we have A=A. 

Comparing the terms in y 2 , we have 0 = Ay 2 + By 2 so S = -A = -4. 

Putting j; = 1, we get 4 = 4(2) + (-4 + C)(l), so4 = 8- 4 + CsoC=0. 

Checking with y = 2, we get the LHS = 4, and the RHS = 4(5) + (-8)(2) = 20 - 16 = 4. 

4 4 4y 



So 



jCf 2 +i) j j 2 + i 



Exercise 6.E.4 

(I haven't filled in all the very straightforward parts of these questions.) 
4 ABC 

(1) 



(2) 



(x + 3)(x - l) 2 x + 3 x - 1 (x - l) 2 

so 4 = ^(x - l) 2 + S(x - l)(x + 3) + C(x + 3). 

Putting x = 1, we get 4 = 4C so C = 1. 
Putting x = -3, we get 4 = 16^ so A = 4. 

Matching the terms in x 2 , we get 0 = ^4x 2 + _6x 2 so B = -A = -4. 
Checking with x = 0 gives the LHS = 4, and the RHS = \-\ (-3) + 3 = 4. 

4 - 1 1 

SO 4 4,* 



(x + 3)(x - l) 2 x + 3 x - 1 (x - l) 2 
3/?+l A B C 



(2p -\)(p + 2) 2 2p-\ p + 2 (p + 2'f 

so 3p + 1 = A(p + 2) 2 + 5(2/? - \)(p + 2) + C(2p - 1). 

Putting p = -2, we get -5 = —5C so C = 1. 

Putting /i = 5, we get 5 + 1 = ^(j) 2 = so ^ = 5- 

Matching the terms in p 2 , we get 0 = Ap 2 + 2Bp 2 so 2B = —A so B = -5. 

Checking with /? = 0 gives the LHS = 1, and the RHS = |(4) - \ (-2) -1 = 1. 

3/7+1 I \ 1 

So 



(2/j - !)(/? + 2) 2 2/?-l /> + 2 (p + 2f 
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4x - 5 4x - 5 A B C 
= = + + 

(2x + l)(x 2 - 6x + 9) (2x+l)(x-3) 2 2x + 1 x-3 (x - 3) 2 

so 4x - 5 = A(x - 3) 2 + B(2x + l)(x - 3) + C(2x + 1). 
Working in a similar way to (2), you should get 

4x - 5 -j 7 1 



(2x + l)(x 2 - 6x + 9) 2x + 1 x-3 (x-3) 2 



lOy A By + C 

+ ■ 



(y - \){y 2 + 9) y - 1 j 2 + 9 
so 10y = A(y 2 + 9) + (Sy + C)(y - 1). 
Putting y = 1, we get 10 = 1(L4 so A = 1. 

Matching the terms in y 2 , we get 0 = Ay 2 + By 2 , so B = —A = — 1. 
Matching the terms in >> we get lOy = Cy - By so C = 10+5 = 9. 
Checking withy = 0, the LHS = 0, and the RHS = +1(9) + (9)(-l) = 0. 

lOy 1 (-J + 9) 1 y-9 
So = + 



(y - l)(y 2 + 9) y - 1 >> 2 + 9 y-l y 2 + 9 

Notice particularly here the rewriting of the second fraction with the minus sign outside, 
using the line of the fraction as a bracket. 

lOx lOx ABC 



(x-l)(x 2 -9) (x - l)(x + 3)(x - 3) x-1 x + 3 

lOx —^ —4 2 

which gives = + + . 

(x - l)(x 2 -9) x-1 x + 3 x-3 

This one is top-heavy, so we rewrite it as 

r 2 + 1 r 2 - 1 + 2 2 2 
= = 1 + = 1 + 



r 2 -l r 2 -l r 2 -\ (r - l)(r + 1) 

2 1 
The partial fractions for come out as 



(r - l)(r +1) r - 1 r + 1 



r 2 + 1 

The final complete answer is — = 1 + 



1 r - 1 r+\ 



This is also top-heavy, so we write it as 

x 4 - 1 + 2 2 
= 1 + . 



x 4 - 1 x 4 - 1 

Did you spot how the x 4 - 1 could be factorised? It uses the difference of two squares 
twice. We can say x 4 - 1 = (x 2 - l)(x 2 + 1) = (x - 1) (x + l)(x 2 + 1). 

2 2 A B Cx + D 

So = = + + 

x 4 - 1 (x - l)(x + l)(x 2 +1) x-1 x + 1 x 2 + 1 

so 2= A(x + l)(x 2 + 1) + B(x - l)(x 2 + 1) + (Cx + D)(x - l)(x + 1). 

Putting x = -1 gives 2 = -4B so B = -\. 

Putting x = 1 gives 2 = 4A so A — 5. 

Matching the terms in x 3 gives 0=v4+,S + CsoC=0. 

Putting x = 0 gives 2 =A - B - D so D = -1. 

Putting x = 2 gives the LHS = 2 and the RHS = \ (3)(5) - 5 (1)(5) - (1)(3) = 2. 

X 4 + 1 i i 1 

The final answer is — = 1 + 



x 4 - 1 x-1 x + 1 x l + 1 
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u 2 -I A B C 

(8) = - + — + 

u 2 (2u +1) u u 2 2u + 1 



(9) 



Students sometimes leave out the first of these fractions, forgetting that u 2 is a 
repeated factor of u times u. 



Now we have, getting rid of fractions, 

u 2 - 1 = Au(2u + 1) + 5(2m + 1) + Cu 2 . 

Putting u = 0 we get -1=5. 

11 l 
Putting u = — 2 we get 4 - 1 = j C so 1 - 4 = C and C = -3. 

Matching the terms in u 2 gives us u 2 = 2Au 2 + Cu 2 so 1 = 2A + C so v4 = 2. 

Putting u = 1 gives the LHS = 0 and the RHS = 2(1) (3) - 1(3) - 3(1) = 0. 

u 2 — 1 2 1 3 

so — = — - — . 

u 2 (2u +1) u u 2u + 1 

x 2 + 1 



(x + 2){x + 4) 

This one is also top-heavy and the rearranging is a bit tricky. You may prefer to use long 
division. If not, it can be rearranged this way: 

x 2 + 1 x 2 + 6x + 8 - 6x - 7 6x + 7 

■= 1 



x 2 + 6x + 8 x 2 + 6x + 8 x 2 + 6x + 8 

Notice that the line of the fraction is acting as a bracket, again. 

6x + 7 6x + 7 -f y 

The partial fractions for = are + . 

x 2 + 6x + 8 (x + 2)(x + 4) x + 2 x + 4 

X + 1 / — J ~2 \ ? ~2 

So = 1 - — — + ^— Ul+ " 



(x + 2)(x + 4) \x + 2 x + 4/ x + 2 x + 4 

Notice the signs! 

" 2 2 2 2 2 

(10) (a) The first four terms of zL are — , — , — , and — . 

r=\ 4r 2 - 1 3 15 35 63 

(b) 4r 2 - I = (2r - l)(2r + 1) (the difference of two squares again!) 

2 2 11 



4r 2 - 1 (2r - l)(2r +1) 2r - 1 2r + 1 
"2 » / 1 1 \ » 1 » 1 

W X — — = 1 - — --- — - = E - — -- E 



1 4r 2 - 1 r=l\2r-l 2r + 1 / r=l2r-l r=l2r+l 

11 1 \ / 1 1 1 1 
= 11+ — + — +... + -— + — +... + + 



3 5 2n - 1 / V 3 5 2« - 1 2« + 1 , 

1 

The second bracket has just been slid along one space, so we get 1 



2n + 1 



1 " 2 

(d) As n -> °o ; > o so X > 1 • 

2n + 1 r=l 4r 2 - 1 

The sum to infinity of this series is 1 . 
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Chapter 7 

Exercise 7.A.2 

(1) The sixth row of Pascal's Triangle is: 

1 6 15 20 15 6 1 (P6) 
so the expansion of (x - 2yf is given by 

x 6 + 6(x 5 )(-2j) 1 + 15(x 4 )(-2j) 2 + 20(x 3 )(-2y) 3 + 15(x 2 )(-2y) 4 + 6(x)(-2y) 5 + (-2yf 
= x 6 - 12x 5 >> + 60x 4 y 1 - 160x 3 >> 3 + 240x 2 y 4 - \92xy 5 + 64y 6 . 

(2) The fifth row of Pascal's Triangle is given by: 

1 5 10 10 5 1 (P5) 
so the expansion of (2x 2 - y 2 ) 5 is 

(2x 2 ) 5 + 5(2x 2 ) 4 (-y 2 ) + 10(2x 2 ) 3 (-y 2 ) 2 + 10(2x 2 ) 2 {-y 2 f + 5(2x 2 ) (-j 2 ) 4 + (- y 2 ) 5 
= 32x 10 - 80 x 8 j 2 + 80x 6 y 4 - 40x 4 y 6 + 10x 2 j 8 - y 10 . 

(3) The fourth row of Pascal's Triangle is: 

1 4 6 4 1 (P4) 

/ l\ 4 
so the expansion of I 2x - — I is 

(2x) 4 + 4(2x) 3 (- -j + 6(2x) 2 (- -j + 4(2x) (- -j + (- - 
8 1 

= 16x 4 - 32x 2 + 24 - — + — . 

x 2 x 4 

(4) The third row of Pascal's Triangle is 

13 3 1 (P3) 

( 3 2 V 
so the expansion of I — + 4x I is 

3\ 3 /3\ 2 /3\ 27 

-1+ 3 1-1 (4x 2 ) + 3 1-1 (4x 2 ) 2 + (4x 2 ) 3 = — + 108 + 144x 3 + 64x 6 . 



It is very easy to make mistakes with complicated terms like we have in these 
questions. It is safest always to put in the working step as I have done, rather than 
trying to do it in your head. 



Exercise 7 A3 

16! 

(1) =1 

16! 0! 

(which you can see must be the case since you are choosing all as). The term is a 16 . 
We define 0! to be equal to 1 to make the formula work in this case. 

(2) = 16. The term is I6a l5 b. 

15! 1! 

16! 16 X 15 

(3) = = 120. The term is 120a 14 Z> 2 . 

14! 2! 2! 



Answers to the exercises 



16! 16 X 15 X 14 X 13 ,„ „ 

(4) = = 1820. The term is 1820a 12 * 4 . 

12! 4! 4! 

(5) = 12870. The term is 12870a 8 * 8 . 

8! 8! 

16! 

(6) = 1820. The term is 1820a 4 * 12 . 

4! 12! 

16! 

(7) =120. The term is 120a 2 * 14 . 

2! 14! 

(8) = 1. The term is * 16 . 

0! 16! 

(9) This works in exactly the same way as the others. 

r is just standing for whichever power of a we might be interested in. 

16! 16! 
We get and the term is 



r!(16-r)! rl (16 - r)\ 

Notice the symmetry of the pairs (1) and (8), (3) and (7), and (4) and (6). This is the 
same symmetry which we saw in Pascal's Triangle. 



Exercise 7.A.4 

(1) The first four terms of (2x -j) 12 . 

Using (Bl), with 'a' = 2x and '*' = -y and n = 12, we get 

12 X 11 12 X 11 X 10 
(2x) 12 + 12(2*)" BO + (2x) 10 B) + — " — (2x) 9 B) 3 

= 4096x 12 - 24576X 11 y + 67584x 1< V - 112640x 9 >- 3 . 

(2) The first four terms of (1 - 2x) 18 . 

Using (B 2) with V = -2x and n = 18, we get 

18 X 17 18 X 17 X 16 

1 + 18(-2x) + (-2x) 2 + (-2x) 3 = 1 - 36x + 612x 2 - 6528x 3 . 

2X1 3X2X1 

(3) The first four terms in the expansion of (1 + x 2 ) 10 . 

Using (B 2) with V = x 2 and n = 10, we get 

10 X 9 10 X 9 X 8 

1 + 10(x 2 ) + (x 2 ) 2 + (x 2 ) 3 = 1 + 10x 2 + 45x 4 + 120x 6 . 

2X1 3X2X1 

(4) The first four terms in the expansion of (j x + 3y) 16 . 
Using (B 1) with 'a' = x/2 and '*' = 3y and n = 16 we get 

x\ 16 (x\ 15 16 X 15 /x\ 14 16 X 15 X 14 /x\ 13 

,2 , _ /o,a3 



+ 16 - (3y) + - (3yY + — ; —{-) (3j) 3 

2/ \2j 2X1 \2/ 3X2X1 

x 16 3 135 945 

+ x 15 y + x 14 y 2 + x 13 y 3 . 

65 536 2048 2048 512 
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Exercise 7.A.5 

9! 10! 

(1) (b) is . (c) is . 

4! 5! 4! 6! 

9! 9! 9! (\ 1 

Then (a) + (b) = + = - + - 

3! 6! 4! 5! 3! 5! \6 4 

9! /4 + 6\ 9! X 10 10! 

= = = = (c)- 

3! 5! \6 X 4/ (3! X 4)(5! X 6) 4! 6! 

This rearrangement is possible because it does not matter which order we multiply the 
numbers in. 

The answer that (a) + (b) = (c) is another way of saying that 84 + 126 in (P 9) gives 210 
in (P 10). 

12! 12! 13! 

(2) (a) is . (b) is . (c) is 



3! 9! 4! 8! 4! 9! 

12! /l 1 

Then (a) + (b) = + = - + - 

' 3! 8! 1 9 4 



12! 


12! 

+ 


3! 9! 


4! 8! 


12! 


f 4 + 9 



12! X 13 13! 

, = = (c). 

3! 8! \9 X 4 J (3! X 4)(8! X 9) 4! 9! 

This answer corresponds to 220 + 495 in (P 12) giving 715 in (P 13). 

(3) This last question looks horrible because it has letters instead of numbers, but it works in 
just the same way as (1) and (2). 

k\ (k+l)] 
(b) is . (c) is 



r\(k-r)\ r\(k+\-r)\ 

k\ k\ 



So (a) + (b) 



(r - 1)! (k - r + 1)! r\(k-r)\ 



k\ ( 1 1\ kl (r + k-r+\ 
+ 



(r- 1)! (k- r)\ \(k - r + 1) r) (r - 1)! (k - r)\ \ r(k - r + 1) / 

k\ X (k + 1) (k + 1)! 

«r- 1)! X r){{k-r)\ X (k - r + 1)) r\ (k - r + \)\ ^' 

k + 1 - r is the same as k - r + 1 . The numbers are just being put together in a different 
order. Having this result will make it very easy for us to prove the binomial theorem by 
induction in Section 7.D.(b). 

Exercise 7.A.6 

(1) (a) The term in x 6 in the expansion of (2 - 3x) n is 
11! 

(2) 5 (-3x) 6 = 10 777 536x 6 . 

5! 6! 

(b) The term in x 6 in the expansion of (2x - y f is 
8! 

6 ,,2 



(2x) b {-yf = 1792x y 

6! 2! 

(c) The term in x 6 in the expansion of (y 2 - 2x 2 ) 10 is 
10! 

2\7 (_1^2\3 = _QAn„14 v 6 



7! 3! 



(y z Y (-2x z Y =-960j 14 x 6 . 
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Answers to the exercises 



(2) (a) The constant term in the expansion of 



2x ■ 



9 



10 



IS 



10! 
5! 5! 



(2x) 5 



= - 1 959 552. 



(b) The constant term in the expansion of 



1 



x + 



IS 



9! 
6! 3! 



84. 



(c) The constant term in the expansion of 



2x 3 + 



1 



16 



IS 



16! 
4! 12! 



(2x 3 ) 4 



12 



= 29 120. 



(3) The term in x 10 in the expansion of (1 + x) 7 (2 - 3x) 5 comes from x 7 X x 3 and x 6 X x 4 
and x 5 X x 5 . So it is: 



[x 7 ] 



5! 



2! 3! 



(2) 2 (-3x) 3 



+ [7x 6 ] 



5! 



1! 4! 



(2) (-3x) 4 



+ 


" 71 5 
X 




2! 5! 



[(-3x) 5 ] 



-1080x 10 + 5670x 10 - 5103x 10 = -513* 



10 



Exercise 7.B.1 

(1) Most people try to spread the numbers much too evenly when they do this. To show you by 
example that such an even spread isn't to be expected, I have written out for you two lists of 
numbers below. The first list gives ten sets of six numbers chosen by using the random 
number generator on my calculator and taking the first two digits, discarding anything over 
49. The second list gives the results of the draws from ten consecutive weeks in a lottery 
similar to the one I described before this exercise. 



List (1) 



6, 


9, 


18, 


29, 


35, 


46 




8, 


28, 


36, 


40, 


41, 


46 


7, 


12, 


28, 


31, 


44, 


46 




5, 


7, 


32, 


34, 


41, 


47 


7, 


18, 


24, 


26, 


27, 


36 




8, 


21, 


25, 


29, 


36, 


43 


13, 


20, 


31, 


32, 


38, 


42 




1, 


11, 


18, 


34, 


46, 


48 


2, 


17, 


22, 


25, 


35, 


46 




7, 


9, 


18, 


19, 


27, 


31 














List (2) 














11, 


25, 


28, 


33, 


34, 


47 




5, 


8, 


23, 


24, 


28, 


48 


16, 


18, 


21, 


27, 


38, 


41 




1, 


15, 


22, 


28, 


40, 


49 


2, 


12, 


20, 


22, 


41, 


45 




2, 


10, 


14, 


25, 


37, 


41 


5, 


10, 


19, 


24, 


34, 


46 




10, 


11, 


29, 


32, 


33, 


40 


10, 


22, 


28, 


30, 


36, 


37 




4, 


5, 


9, 


25, 


30, 


47 



You can see from these lists that two consecutive numbers have come up surprisingly 
often, in fact ten times altogether. Also, runs of the same number occur, with 10 coming up 
in four consecutive lottery draws, for example. There are also corridors with no numbers in. 
For example, there are no 42, 43 or 44 in the ten consecutive lottery draws. You can see the 
uneven spread of the numbers even more clearly if you show these lists on squared paper 
also. 

(2) Exactly the same rules of probability apply to each set of numbers, so each of them have an 
equal chance of 

1 

13983 816 
of being the correct choice. 

(3) You would be wiser to pick the third choice. The reason for this is that many people think 
that 'random' means more or less evenly spread. This is not so. 'Random' means just that; 
the choices can happen anywhere, and therefore there will sometimes be bunching, and 
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sometimes large gaps, as you can see in the two lists above. The behaviour of such random 
distributions of numbers is explained mathematically in courses on statistics. These 
mathematical descriptions can then be used to answer questions such as how much concern 
should be felt about clusters of cases of particular illnesses such as childhood leukaemia; is 
there a local cause or are such clusters to be expected anyway? 

Since most people would probably think that 44, 45, 46, 47, 48, 49, would be very 
unlikely, if these numbers did come up, you would be less likely to have to share your 
winnings! The choice of 1, 2, 3, 4, 5, 6 would be less satisfactory since many people have 
favourite 'lucky' numbers which they choose, and such numbers are usually small. Other 
people are likely to choose birthdays, so they will be restricted to 3 1 or less. It is also likely 
that many people would be very doubtful about choosing even two numbers together, never 
mind the whole six. In fact, in a situation like this, the probability of getting at least one 
pair of consecutive numbers is about one half; much higher than you would probably guess. 

(4) On any particular draw of this lottery, there will be 43 numbers which are not chosen. The 
number of different possibilities if six numbers are chosen from these 43 numbers and the 
order of choice doesn't matter, is given by 

43! 

= 6 096 454. 

37! 6! 

We know already that the total number of ways of picking 6 numbers from 49 numbers 
is 13 983 816. 

This means that there are 13 983 816 - 6 096 454 = 7 887 362 ways of choosing at least 
one number correctly. Therefore the probability of doing this is 

7 887 362 

= 0.564 to 3 d.p. 

13 983 816 

We see that you should expect to have at least one number correct more often than not, 
which makes it not quite as encouraging for future prize-winning as it might seem. 

Exercise 7.C.1 

(-3)(-4) „ (-3)(-4)(-5) 

(1) (1 + 2x)~ 3 = 1 + (-3)(2x) + (2x) 2 + (2x) 3 . . . 

2X1 3X2X1 

= 1 - 6x + 24x 2 - 80x 3 . . . 
The expansion is valid if \2x\ < 1 or -\ < x < + \. 

(2) (l - 3x) _1 = 1 + (-l)(-3x) + (-3x) 2 + (-3x) 3 . . . 

2X1 3X2X1 

= 1 + 3x + 9x 2 + 27x 3 + . . . 

This is just the same as the first expansion in this section except that x has been replaced by 
-3x, so it tidies up particularly nicely. 



It's worth remembering that 

(1 - 4)- 1 = 1 + 4 + * 2 + 4 3 +. . . and (1 + = 1 - 4 + 4 2 - 4 3 + 4 4 - 
where 4 stands for whatever we have in this position in the bracket, and |4| < 1 . 



Here, 4 = 3x so we must have |3x| < 1 or -\ < x < + f for the expansion to be valid. 

™ „ 1 N 2 i , , ( - 2)( -^ A ^2 (-2X- 3 X- 4 ) A ,3 

(3) (1 + 3 x)- 2 = 1 + (-2)(5X) + — — — (5 x) 2 + (5 x) 3 +. . . 

Z X 1 3XZX1 

- 1 2 4-1 2 ± 3 4. 
1 3 3 27 * ' * 

For the expansion to be valid, we must have \^x\ < 1 or -3 < x < + 3. 
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Answers to the exercises 



Exercise 7.C.2 



(1) (a) Here, * = -3x. For the expansion to be valid, we must have |4| < 1, so |3x| < 1, so 
|x| < 3 or -3 < x < 3. (Remember that |-3x| = |3x|.) Then: 

(i - 3x)" 3 =i+d) ( -3x) + ( ^ )( ?, )( " 3x)2 + foi*^ (-3x) 3 +. 



2 X 1 



3X2X1 



1 - x - x 



(b) Here, * = — , so we must have 
2 



2 5 3 



1 + ■ 



2/3 12 
= 1 + 

' 3 



x , giving the first four terms. 
< 1 that is |x| < 2 or -2 < x < 2. Then: 

(1x4) M 2 + (1x4x4) /* 

2X12 3 X 2 X 1 \2 



= 1 H — - — + , giving the first four terms. 

3 36 162 

(c) This time, we must rearrange to get the (1 + *) form. 



(16 - 3x) ! 



= 2 



16 



1/4 



3xV 



= 21 



3x 
16 



because 16 = 2. 



l + (i) 



3x 
16 

3x 27x 2 



(i)(4) / 3x\ 2 (|X4)(4) 



2 X 1 
189x 3 



16 



3X2X1 



3x N 
16, 



64 8192 524288 



3x 27x 2 



189x 3 



32 4096 262144 



, giving the first four terms. 



Here, * = 



3x 



so we must have 



3x 



16 



16 16 

< 1 so -16 < 3x < 16 so - — < x — . 

3 3 



(d) Rearranging again, we have 



(4 + x)- 



4 1 + 



-1/2 



1 + 



-1/2 



1 

4 1/2 
1 

2 
1 
2 
1 



1 + 



1 + H) 



1 + 



-1/2 



X 

1 -- + 



X 

4 

3x 2 



(4x4) (x 



2 X 1 

5x 3 



K, 3 



H)(-f)H) I* 

3X2X1 



8 128 1024 

2 5x 3 



x 3x 
— + 



Here, * = — , so we require 
4 



2 16 256 2048 

< 1, so -4 < x < 4. 



, giving the first four terms. 



(e) Again, rearranging is necessary. 



(-2 + x)- 2 = 



-2 1 



-2 



(-2X 



-2 
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We must take out -2 to make the bracket start with +1. Taking out +2 will not do 
what we want. 



(-2) 2 



(-2)(-3)(-4) 



1 + ( . 2) ,.i +<^> _* . 

2/ 2X1 2 3X2X1 



3x 2 x J 

1 + x + + — + 

4 2 

,2 





3 


H) 





1 x 3x x J 

= + — + — + + — , giving the first four terms. 

4 4 16 8 



. x 

Here, * = - — , so we require 

2 2 

(f) Rearranging yet again, we have 
(27 - 4x)- 2/3 = 2T m | I 



< 1 so -1 < - < 1 so -2 < x < 2. 
2 



4x^ 2 5 
21, 



2T 



2/3 



4x^ 2 5 
27 , 



The first four terms are given by 



4x\ | (-f)(-f) / 4xV | (-fX-|)(-f) / 4xV 
27/ 2X1 I 27/ 3X2X1 \ 27/ 



(NB 27 2/3 
1 
9 
1 



i + (-!) 

(27i«)2 = 32 = 9 See Sect i on i.D.(b) if you need help with this.) 



8x 80x 2 2560x 3 
1 + — + + + 



81 6561 



1594323 
2560x 3 



8x 80x 2 

- + + + . 

9 729 59 049 14 348 907 



4x 

This time, 4 = , so we require 

27 



< 1 SO - -J < X < — . 



( 2 ) Method (l) 



(3 - 2xX' (1 + 3X)- 1 = | (1 - | x)- 1 (1 + 3xX ! 

= \ (1 + f x + 5 x 2 + . . 0(1 - 3x + 9x 2 - . . .) 

Now we take just the multiplications giving the terms up to x 2 , giving 

\ [1 - 3x + 9x 2 
+ fx - 2x 2 
+ fx 2 ] 



2 + fl2x = I 2 ,62 2 

3 X ~r ^ X y ^ Q X < 27 X . 



Method (2) Writing 



2_ 

11 



3 - 2x 1 + 3x 



in partial fractions, we get 

(3 - 2x0(1 + 3jc) 

= H (3 - 2x)- ! + £(1 + 3x)-' 



11 



= TT [3- 1 (l-fx^nG +3*)" 1 

= ^ [1 + 2x/3 + (2x/3) 2 + . . .] + it [1 - 3x + (3x) 



(tidied up) 



67 j 

x + 27 x giving just the first three terms. 
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Answers to the exercises 



In order for the first expansion to be valid, we require that 



< 1 so- 



For the second expansion to be valid, we must have \3x\ < 1 so - 3 < x < 3. 

Therefore, in order to fit both requirements simultaneously, we must take 
the tighter of the two restrictions which is that - 3 < x < 3. 



Exercise 7.D.1 

(1) We want to show that 1 + 2 + 3 + 4 + . . . + n = \n(n + 1). 

First, check that the statement is true when n — 1 . 

LHS = 1. RHS = \ X 1 X 2 = 1, so, yes, St[l] is true. 

Then, suppose that the statement is true for n = a particular value, k, so 

1 +2 + 3 + . . . + k = \k(k + 1). St[&] 

Now we must show that this means that it is also true for n = k + 1 . That is, we must 
show that 

1 + 2 + 3 + . . . + k + (k + 1) = \(k + l)(k + 2). St[£ + 1] 

(The extra term has been added on to the LHS and k has been replaced by k + 1 in the 
formula on the RHS.) 

Now, the whole bunch of 1 + 2 + 3 + . . . + k on the LHS of St[k + 1] can be replaced 
by -jk(k +1), using St[£]. So we have that 

the LHS of St[k + 1] = \k(k + 1) + (k + 1). 

But, factorising this, we have 

\ k(k + 1) + (k + 1) = \(k + l)(k + 2) = the RHS of St[k + 1]. 

We have taken out the factor of (k + 1), and also the 5, because we know we want it at the 
front. This means that we must have the 2 in the second bracket to make the whole thing 
multiply out correctly. 

So, if the statement is true when n = k, then it is also true when n = k+ 1. 

But it is true when n = 1, so therefore it is true when n = 2, and so on through all the 
counting numbers. 

(2) (a) S 1 = 1 = l 2 (b) S 2 = 4 = 2 2 (c) S 3 = 9 = 3 2 (d) S 4 = 16 = 4 2 

So it looks as if the sum of n odd numbers is n 2 . 

Suppose this statement is true when n = a particular value, k. Then 

1 + 3 + 5 + . . . + (2k - 1) = k 2 . St[k] 

Nasty pitfall. The «th odd number isn't n, but 2n - 1 . (For instance, the third odd number 
isn't 3 but 5.) 

We must now show that if St[£] is true, then St[& + 1] is also true, so 

1 + 3 + 5 + . . . + (2k - 1) + (2k + 1) = (k + l) 2 . St[k + 1] 

Using St[k], the LHS of St[k + 1] = k 2 + (2k + 1) = (k + l) 2 = RHS of St[k + 1]. 

But the statement is true when n = 1, so therefore it is true through all the counting 
numbers. 

(3) We have to show that 

1 X2 + 2X3 + 3X4 + ... + «(« + l) = \n(n + l)(n + 2). 

First, we check that this formula is true when n = 1 . 

The LHS is 1 X 2 = 2 and the RHS is 1/3(1)(2)(3) = 2, so St[l] is true. 
Then we suppose the statement is true for n = a particular value, k, so that 

1 X 2 + 2 X 3 + 3 X 4 + . . . + k(k + 1) = \k(k + \)(k + 2). St[£] 
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We must then show that St[k + 1] is true, that is, that 

1 X2 + 2X3 + 3X4 + ... + ^+l) + (H l)(k + 2) = j(k + l)(k + 2)(k + 3), 

adding the extra term to the LHS and putting n = k + 1 on the RHS. 
Using St[/fc], the LHS of St[£ + 1] is 

\k(k + \)(k + 2) + (k + l)(k + 2). 

Now, we factorise by taking out \ (k + l)(k + 2). We get 

j*(jfc + IX* + 2)(k + 3) 

which is the RHS of St[k + 1]. (Check by multiplying out this last bracket if you are unsure 
about this step.) 

So, if St[k] is true, then St[k + 1] is true. 

But St[l] is true, and so St[2] is true, and so on through all the counting numbers. 

Chapter 8 

Exercise 8.A.1 

(1) dyldx = I4x + 12x 3 (2) dxldt = 5-\t 2 

(3) y = 3 - 2x~ 3 so dy/dx = 0 + 6x~ 4 = 6/x 4 . 

(4) dxldt = 2 x \ x r 1/2 - 3 x \ x r 3/2 = r 1/2 - |r 3/2 

(5) (a) If x is increased by a small amount dx, so that y gets increased by a correspondingly 

small amount by, we have 

y + dy = (x + dx? = x 3 + 3x 2 (<5x) + 3x0x) 2 + (<5x) 3 
But y = x 3 

so <5y = 3x 2 ((3x) + 3x((5x) 2 + ((5x) 3 so dy/dx = 3x 2 + 3x(<5x) + (dx) 2 . 

Now, when we let dx —> 0, we can ignore the terms 3x(dx) and (<5x) 2 , so the limit of 

dy/dx as dx — > 0 is cfy/<ix = 3x 2 . 

(b) Here, we are taking the particular case of x = 2 and dx = 0.001 so we have 

y + dy = (x + dxf = (2 + 0.001) 3 = 2 3 + 3(2 2 ) (0.001) +3(2)(0.001) 2 + (0.001) 3 

But y = 2 3 so 

dy = 3(2) 2 (0.001) + 3(2)(0.001) 2 + (0.001) 3 

and since dx = 0.001, we have 

dy/dx = 3(2 2 ) + 3(2)(0.001) + (0.001) 2 = 12.006001. 

If we make dx even smaller, dy/dx will approach even more closely to the value of 
3(2 2 ) = 12 given by dyldx = 3x 2 when x = 2. 

Exercise 8.A.2 

(1) We have y = sin / so dyldt = cos t and d 2 y/dx 2 = - sin t = -y 

so we do get the same kind of link between the acceleration and the distance of Y from O. 
The point Y is also moving in SHM, but it is starting from the central position on its path 
whereas X started from its most extreme positive position. 

(When t = 0 we have y = 0 too, but if x = cos t, then x = 1 when t = 0.) 

(2) For this new object, x = 3 cos t + 4 sin t. 

dxldt = — 3 sin t + 4 cos t and d 2 yldt 2 = -3 cos t -4 sin t = -x. 

So this object is also moving in SHM. This equation describes a motion which starts from 
an intermediate point when the time t = 0. 

We know from Section 5.D.(f) that if x = 3 cos t + 4 sin t then the two most extreme 
values of x are x = +5 and x = -5. When ? = 0, x = 3. 
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Exercise 8.C.1 



Here are the answers you should have. To help you, I have shown the two separate bits in square 
brackets, before tidying up, and also put what I chose for X at the side of each answer. 

(1) dyldx = [4(2x 2 + 3) 3 ] [4x] = 16x(2x 2 + 3) 3 with X = 2x 2 + 3. 

(2) dxldt = [5{t 3 + 2) 4 ] [3t 2 ] = \5t 2 {t 3 + If with X= t 3 + 2. 

(3) dyldx = [4(3x 2 - 2x) 3 ] [6x - 2] = 8(3x - 1) (3x 2 - 2x) 3 with X = 3x 2 - 2x. 

(4) dxldt = [\{3t + 4)- 1/2 ] [3] = | (3f + 4)- 1/2 with X = 3/ + 4. 

(5) dyldx = [3e 4x ] [4] = 12e 4x with X= 4x. 

(6) dxldt = [e' 2+l ] [2t] = 2te' 2+l with X=t 2 +\. 

(7) dyldx = [2e x2+x ] [2x + 1] = 2(2x + l)e x2 + x with X = x 2 + x. 

(8) cfc/A = [-sin (4/ + jr/3)] [4] = -4sin {At + n/3) with X = At + jt/3. 

(9) cfe/A = cos / + [cos 2/] [2] = cos t + 2 cos It with X = 2t. 

(10) cfy/Jx = [cos (x 2 )] [2x] = 2x cos (x 2 ) with X=x 2 . 

(11) dyldx = [2 sin x] [cos x] = 2 sin x cos x with X = sin x. 

(12) cfy/Jx = [3 cos 2 x] [- sin x] = -3 sin x cos 2 x with X = cos x. 



(13) 



dx 



1 

4x 



[4] = 



4x 



with X = Ax. 



Students often give the answer of — for this one! 

4x 



(14) 
(15) 



dy 
dx 

dx 
dt 



1 



3x + 1 
1 

2t 2 + 1 



[3] = 
[At\- 



3x+ 1 
At 



with X=3x+l. 



2t 2 + 1 



with X=2t 2 +1. 



(16) dyldx = [-sin (5x 2 + Jt)] [lOx] = -lOx sin (5x 2 + n) with X = 5x 2 + jr. 

(17) dxldt = [cos {2t 2 + 3)] [4?] = At cos (2? 2 + 3) with X = 2t 2 + 3. 



(18) 



1 



sin x 



cos x 

[cos x] = = cot x with X = sin x. 

sin x 



Exercise 8.C.2 

These are the answers you should have for dyldx in each case. 

2 

(1) 5e 5x (2) -2e- 2x (3) 2xe x (4) 
-1 1 



(5) 



1 



(6) = 

1 - x x — 1 

(10) -3 sin (3x + 4). 



2x + 3 1 + x 

(7) 7 cos 7x (8) -4 sin x cos 3 x (9) 2 cos (2x + jt) 
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Exercise 8.C.3 

I have used the square brackets again to make the working clear for you. 

(1) dyldx = [5 cos 4 2x] [-2 sin 2x] — -10 sin 2x cos 4 2x. 

(2) dyldx = [3 sin 2 {Ax + 1)] [4 cos (Ax + 1)] = 12 cos (Ax + 1) sin 2 (Ax + 1). 



(3) 



(5) 



(6) 



(7) 



dx 
dt 



1 



sin (2t + 3) 



[2 cos (2t + 3)] = 2 cot (2? + 3). 



(4) dxIdO = [3 (2 cos 20 + 5) 2 ] [-4 sin 26] = -12 sin 20 (2 cos 20 + 5) 2 



dy 
dx 

dx 
dt 

dx 
dt 



1 



1 + cos X 

1 



3t + sin 2 3t 
1 



2 + e 



- sin x 

[- sin x] = . 

1 + cos x 

[3 + 2 sin 3t (3 cos 3/)] 

2te' 2+l 



3 + 6 sin 3t cos 3t 
3t + sin 2 3* 



[2te< +1 ] = 



2 + e' +1 

(8) cfy/dx = [cos (cos Ax)] [-4 sin Ax] = -4 sin Ax cos (cos 4x). 

(9) dyldt = [\ (1 + sin 2 0~ 1/2 ] [2 sin / cos f] = sin ? cos t (1 + sin 2 ty 112 . 
(10) Using the third law of logs to make this question easier, we have 



y = J In (1 + sin 2 t) so — 

dt 



1 



1 + sin 2 t 



[2 sin / cos t] = 



sin £ cos 7 
1 + sin 2 t 



Exercise 8.C.4 

Again, I have sometimes used square brackets to show the way in which the answers build up 
more clearly. 

(1) u = 7x 2 and v = cos 3x so 

dy 



dx 



(cos 3x) (14x) + (7x ) (-3 sin 3x) = 14x cos 3x - 21x sin 3x. 



(2) u = e 3x and v = sin 2x so 
dy 



dx 



= (sin 2x) (3e ix ) + e ix (2 cos 2x) = e ix (3 sin 2x + 2 cos 2x). 



(3) u = Ax 5 and v = (x 2 + 3) 3 so 
dy 



dx 



[0 + 3) j ] [20x 4 ] + [4x 5 ] [3(x 2 + 3) 2 (2x)] 



4x 4 (x 2 + 3) 2 [5(x 2 + 3) + 6x 2 ] = 4x 4 (x 2 + 3) 2 (1 lx 2 + 15). 



(4) u = e 2t+ 1 and v = cos (2t + 1) so 

dx 

— = [cos (2t+ 1)] [2e 2 ' +1 ] + [e 2t+1 ] [-2 sin (2t + 1)] 
dt 

= 2e 2,+ 1 (cos (2t + 1) - sin (2t + 1)). 

(5) u = It 2 and v = In (2t - 1) so 

dx 



dt 



[In (2t- \)][\At] + [It 2 ] 



2t- 1 



= 14? In (2t - 1) + 



It - 1 
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(6) u = {t 2 + \) m and v = sin (2t + n) so 

dx , 

■ = [sin (2t + 7t)\ [5 (? 2 + 1)- 1/2 (20] + [(? 2 + 1) 1/2 ] [2 cos (2t + «)] 



/ sin (2f + tt) 



+ 2(7 2 + 1) 1/2 cos (2? + n). 



(t 2 + 1) 1/2 

(7) y = [(x 2 + l) 5 ] [e 3x cos 2x] so u = (x 2 + l) 5 and v = e 3x cos 2x. 

First, using the Product Rule to find dv/dx, we have 
dv 

— = [cos 2x] [3e ix ] + [e ] [-2 sin 2x] = e ix (3 cos 2x - 2 sin 2x). 
dx 

Now, using the Product Rule again to find dyldx, we have 
dy 

— = [e 3x cos 2x] [5(x 2 + l) 4 (2x)] + [(x 2 + l) 5 ] [e 3x (3 cos 2x - 2 sin 2x)] 
dx 

= e 3x (x 2 + l) 4 [lOx cos 2x + (x 2 + 1) (3 cos 2x - 2 sin 2x)]. 

(8) x = (2 + t)e 3t so dxldt = (e 3 ')(l) + (2 + t)(3e 3t ) = (7 + 3t)e 3t . 

Using the Product Rule again, we get 

d 2 x 

— = (7 + 3?) (3e 3i ) + (e 3t ) (3) = (24 + 9t)e 3r 
dt 2 

d 2 x 

so — - 6 dxldt + 9x = (24 + 9t)e 3 ' - 6(7 + 30e 3 ' + 9(2 + t)e 3 ' = (0 + 00e 3 ' = 0. 

(9) x = e fe so dxldt = k e k < and d 2 xldt 2 = k 2 e kt . 

d^x dx 

— - 2— -3x = 0 so k 2 e kt - 2ke k < - 3e kt = 0. 
dt 2 dt 

Therefore e k '(k 2 - 2k - 3) = 0 but e fe is never equal to zero. 
So we have 

k 2 - 2k -3 = 0 so (k-3) (k+ 1) = 0 so £=3 or £ = -1. 

(10) We know from the last question that both e 3t and e~* fit this equation. Multiplying them by 
constant numbers will make no difference to this fit. 

Therefore x = Ae 3t + Be~' is a solution to this equation. 

(In fact, it is the solution to this equation which covers all possible situations.) 

(11) x = e-< In (1 + e'). 

Using the Product Rule gives 

dx ( e' \ 1 

— = In (1 + e'X-e-') + e~< = -e* In (1 + e>) + 

dt \ 1 + e'j \+e* 

because (e') (e~ l ) = e° = 1. 

dx 11 

Therefore — + x = -e~* In (1 + e') + + e~* ln(l + e 1 ) = . 

' 1 -i- a > 



dt 1 + e' 1 + e 



Exercise 8.C.5 

dy (sin x) (- sin x) - (cos x) (cos x) sin 2 x + cos 2 x 

(1) — = = = - cosec x. 

dx sin x sin x 

dy sin x 

(2) If y = (cos x) _1 then — = -(cos x)~ 2 (-sin x) = — = sec x tan x. 

dx cos x 
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1 (sin x)(0) - (l)(cos x) cos x 

(3) If y = then dyldx = = - — - — = - cosec x cot x. 

sin x sin 2 x sin 2 x 

These last two questions show how you can use either the Product Rule or the Quotient 
Rule to answer them. 

sin t dx (1 + cos t) cos t - sin t (-sin t) 1 

(4) Ifx = then — = = . 

1 + cos t dt (1 + cos t) 1 + cos t 

(5) If y = In (sec x + tan x) then we know from Example (1) in Section 8.C.(e) that 

dldx (tan x) = sec 2 x and from question (2) above that dldx (sec x) = sec x tan x. Therefore 

d sec x tan x + sec 2 x 

— (In (sec x + tan x)) = = sec x 

dx sec x + tan x 

taking out sec x as a factor on the top of this fraction and then cancelling. 

e x — e~ x 
e + e 

dy _ (e x + e- x )(e x + e~ x ) - (e x - e - x )(e x - O 
dx (e x + e- x f 

(e 2x + 2 + e- 2x ) - (e 2x -2 + e~ 2x ) 4 



(e x + e- x f (e x + e^) 2 

(remembering that e x X e~ x = e x ~ x = e° = 1 and dldx{e~ x ) = -e~ x ). 
l\ +x\ 

(7) y = In = In (1 + x) - In (1 - x) using the second law of logs, so 

\l-xj 

dy 1 / -1 \ (1 - x) + (1 + x) 



dx l+x \l-xj (l+x)(l-x) 1-x 2 

(8) Here, we need to use the Product Rule to differentiate the V of x 2 sin x on the top. Doing 
this first gives 

dy . 

— = (sin x) (2x) + x (cos x). 

dx 



So 



dy (3 - x) (2x sin x + x cos x) - (x sin x) (-1) 
dx (3 - x) 2 

x(3x cos x - x 2 cos x - x sin x + 6 sin x) 



,2 



(3-x) 

3x - 2 dy (2x + 3)(3) - (3x - 2)(2) 13 

(9) y = so 



2x + 3 dx (2x + 3) 2 (2x + 3) 2 

(Notice that this time we have found that the gradient of the curve is always positive, 
except when x = -j, when the curve itself is undefined.) 

ax + b dy (cx + d)(a) - (ax + b)(c) ad - be 

(10) y = so — = = -. 

cx + d dx (cx + d) (cx + d) 

The bottom of this fraction is always positive, and ad - be is a constant number itself since 
it is made up of constant numbers. 

Therefore, dyldx will remain always either positive or negative depending on the 
particular values of a, b, c, and d, unless x = -die, when the function y itself is undefined. 

The only exception to this is the special case when ad - be = 0 when 

dy ax + b 

— = 0 everywhere, so y = must be totally flat. 

dx cx + d 
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How could this be? 

If we put a = 6, d = 1, b = 3 and c = 2, we would have an example of this. We then get 

6x + 3 3(2x + 1) 
y = = = 3. 

2x + 1 2x + 1 



It is totally flat! 



See if you can show for yourself that, if ad - be = 0, then 

ax + b 

y = : 

cx + a 

must simplify down to the form y — a constant, so giving a horizontal line with a 
gradient of zero. 

Hint: if ad - be = 0 then = fee. Put ad = be = k and substitute for d and c in 
the equation for y. Then tidy this up. 



Exercise 8.D.1 

(1) If e x = 2 then e -x = \le x = \. 

Using the definitions for sinh x, cosh x and tanh x gives us 

sinh x = \ (2 - \) = \, 
cosh x = \ (2 + ^) = f, 
tanh x = (|) (|) = f. 

(2) e° = 1 so sinh x = 0 and cosh x = 1. This gives tanh x = 0/1 = 0. 

(3) (a) 2 sinh 2x (b) 3 cosh (3x + 5) 

(c) Using the Product Rule, we get 

2e Zx sinh 3x + 3e 2x cosh 3x = e 2x (2 sinh 3x + 3 cosh 3x). 
sinh x 

(d) 5 sech 2 5x (e) = tanh x (f) 2 cosh 3x (3 sinh 3x) = 6 sinh 3x cosh 3x 

cosh x 

Exercise 8.D.2 

(1) dyldx = cosh x and cosh(0) = 1, so the gradient of the curve y = sinh x at the origin is 1. 

(2) The line y = x is the tangent to the curve y = sinh x when it passes through the origin, since this 
line also passes through the origin and has a gradient of 1 . 

(3) Since y = sinh x and y = sinh -1 x are symmetrically placed either side of y — x, this line is also 
the tangent to y = sinh -1 x at the origin, and the gradient of sinh -1 x here must also be 1 . 

Exercise 8.D.3 

( x\ dy 3 

(1) (a) If y = tan — then a = 3 so — = 

\3/ dx x 2 + 9 

( x\ dy I 

(b) If y = sinh — then a = 5 so 



,5/ ' ' <fe Vx 2 + 25 

3x\ fi?y | 

(c) If y = tan -1 I — then a = 2/3 and 



2 j dx x 1 + 4/9 9x 2 + 4 

,'2x\ dy 1 2 

(d) If v = sinh -1 — then a = 3/2 and — = = , . 

dx Vx 2 + 9/4 \4v • 9 
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dy 1 
(2) (a) \fy = tan" 1 (5x) then — = 5 X 



dx (5x) 2 + 1 25x 2 + 1 

dy 1 3 

(b) If y = sinlr 1 (3x) then — = 3 X 



(c) If y = tan" 1 (x + 3) then 

(d) If y = tan" 1 (3x + 4) then 

(e) If y = tan" 1 (1 - x) then 

(f) If y = sinlr 1 (2x + 1) then 

(g) If y = sinlr 1 (3 - 2x) then 



dx ' V(3x) 2 + 1 x 9.v- + 1 
dy 1 1 



dx (x + 3) 2 + 1 x 2 + 6x + 10 
dy 3 3 



dx (3x + 4) 2 + 1 9x 2 + 24x + 17 

dy -1 -1 

dx (1 - x) 2 + 1 x 2 - 2x + 2 ' 

dy 2 2 



dx V(2x + l) 2 + 1 V4x 2 + 4x + 2 ' 
dy -2 -2 



dx v'(3 - 2x) 2 + 1 V4x 2 - 12jc + 10 ' 



I'll include more details for the last three questions, by putting X= 'lump', as these are 
slightly more complicated. 

(x + 3\ x + 3 1 

(h) y = tan so X = and dXIdx = — , so 

\ 4 / 4 4 

dy I 1 16 16 



so 



dx X 2 +l (^Hr) 2 +1 (x + 3) 2 + 16 x 2 + 6x + 25 

dy dy dX 4 

dx dX dx x 2 + 6x + 25 



2x + l\ 2x + 1 dX 2 

(i) v = tan so X= and — = — . 

' 3 / 3 dx 3 

dy I 9 9 



C^) 2 + 1 (2x + l) 2 + 9 4x 2 + 4x + 10 

dy 2 9 3 

so — = - X 



dx 3 4x 2 + 4x + 10 2x 2 + 2x + 5 

/3x + 2\ 3x + 2 dX 3 

(j) y = tan so X= and — = — . 

\ 4 / 4 dx 4 

cfy 1 16 16 



so 



dX (^f 1 ) 2 + 1 (3x + 2) 2 + 16 9x 2 + 12x + 20 
dy dy dX 12 



dx dX dx 9x 2 + 12x + 20 
(3) We have y = tanlr 1 x = \ In 



1 + x 
1 -x 
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The first thing to do here is to use the second rule of logs to write y more simply. (This 
saves a huge amount of work.) This gives us 

y = \ In (1 + x) - \ In (1 - x) 

dy , / 1 J\-x+\+x\ 1 

dx 2 \\+x l-x) 2 \ l-x 2 ) l-x 2 ' 

(4) We can solve 8 sinh x = 3 sech x in two ways. 

method (l) Do it in the same way you would solve a trig equation. 

8 sinh x = 3/cosh x so 8 sinh x cosh x — 3 so 4 sinh 2x — 3 

using the rule of Section 8.D.(d) that sinh 2x = 2 sinh x cosh x. 

We now have sinh 2x — 4 and sinh -1 (4) = 0.6931 sox = 0.347 to 3 d.p. 

Notice that you don't have the infinite numbers of solutions that you get 
with trig equations to worry about here! This is one way in which sinh and 
cosh don 't behave like sin and cos. 

I show a sketch of what's happening graphically here below. 




method (2) Write everything in terms of e x . This gives 

so 4(e x - e~ x ){e x + e~ x ) = 6 



y e x + e 

so 2{e 2x - e- 2x ) = 3 so 2e 4x - 3e 2x -2 = 0 

3 ± + 16 3 ± 5 

so e 2x = = = 2 

4 4 

since e 2x is always positive. This gives 2x = In 2 so x = 0.347 as before. 

(5) (a) We solve 2 sinh 2 x - 5 cosh x - 1 = 0 in a very similar way to the trig equations in 
Section 5.E.(b). 

sinh 2 x = cosh 2 x - 1 so 2(cosh 2 x - 1) - 5 cosh x - 1 = 0 

so 2 cosh 2 x - 5 cosh x - 3 = 0 so (2 cosh x + l)(cosh x - 3) = 0 

so cosh x = -\ (which is impossible as you can see from the graph in Figure 8.D.5) or 
cosh x = 3. This gives x = cosh -1 (3) = 1.763 to 3 d.p. 

(x could also be -1.763 from the symmetry of the cosh graph. Excellent, if you also 
got this answer!) 

(b) 3 sech 2 x + 8 tanh x - 1 = 0 so, using 1 - tanh 2 x = sech 2 x, we have 

3(1- tanh 2 x) + 8 tanh x-7 = 0 so 0 = (3 tanh x - 2)(tanh x - 2) 

so tanh x = \ giving x = 0.805 to 3 d.p. or tanh x = 2 which is impossible since the 
values of tanh x lie between -1 and +1 as you can see in Figure 8.D.7. 
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Exercise 8.E.1 



(1) y = e x so dyldx = e x , 

(a) x = 0 so dyldx = e° = 1 so the gradient of the tangent at (0, 1) is 1. 

The equation of this tangent is y - 1 = l(x - 0) so y = x + 1. 

(b) x = 1 so dyldx = e l = e. 

The equation of the tangent at (1, e) is y - e = e(x - 1) so y = ex. 

Notice that this particular tangent passes through the origin, and has a gradient of e. 
The lines y — x and y = 3x also pass through the origin. Since 1 < e, y — x doesn't cut 
the curve of y = e x at all - there are no solutions to the equation x = e x . However, the 
line y = 3x cuts the curve y = e x twice, because 3 > e. This means that the equation 
3x = e x has two solutions. 

(c) x = 2 so dyldx = e 2 . 

The equation of the tangent at (2, e 2 ) is y - e 2 = e 2 (x - 2) so y = e 2 x - e 2 , so it 
cuts the y-axis at about -7.4. 

(2) y = tan x so dyldx = sec 2 x. (I showed this result in Section 8.C.(e).) 

(a) x = 0 so dyldx = 1. 

When x = 0, j = 0 also, so the equation of the tangent at (0,0) is y = x. 

(b) If x = n/4, then y = 1 and 

4V 11 

— = sec 2 (jt/4) = = — ■ — = 2. 

dx cos 2 (jr/4) (4p 2 

The equation of the tangent at (;t/4, 1) is y - 1 = 2(x - ji/4) so j = 2x + (1 - ji/2). 
This tangent cuts the j-axis at about -0.57. 

(3) Tangent (a) in (2) actually crosses the curve at the origin. It is behaving just like the 
tangents at the origin to y = sinh x and y = tanh x which we found in the last section. Each 
of these tangents happens to be the line y = x, but other tangents can also cross their curves. 
An example of this is tangent (c) to y = cos x at x = nil, just before this exercise. 



Table 8.E.2 



Function 


dx 


<afy 

Value of — 

dx 


dy 2 
'dx 2 


dy 2 

Is — +, - or 0? 

dx 2 


(a) y = x 2 at O 


2x 


0 


2 


+ 


(b) y = x 3 at O 


3x 2 


0 


6x 


0 


(c) y = -x 2 at O 


-2x 


0 


-2 




(d) y = tan x at O 


sec 2 x 


1 


2 sec x (sec x tan x) 
= 2 sec 2 x tan x 
(Chain Rule) 


0 


(e) y = cos x 

(i) at A 

(ii) at B 

(iii) at C 

(iv) atZ) 


- sin x 


(i) 0 

(ii) -1 

(iii) 0 

(iv) 1 


- cos x 


(i) 

(ii) 0 

(iii) + 

(iv) 0 


(f) >> = x 4 at O 


4x 3 


0 


12x 2 


0 
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Exercise 8.E.2 

x — 1 X — 1 

x 2 - 4 (x - 2)(x + 2) 

\ so j cuts the y-axis at (0, j). 

(2) fix) = 0 when x = 1 so j cuts the x-axis at (1, 0). 

(3) We cannot divide by zero so we must exclude the values x = -2 and x = +2. 

The straight lines x = —2 and x = +2 are vertical asymptotes fory = fix). 
The value of/(*) is large and negative if x is just less than -2. 
The value ofy(*) is large and positive if x is just greater than -2. 
The value of fix) is large and negative if x is just less than +2. 
The value of fix) is large and positive if x is just greater than +2. 

x-1 (l/x)-(l/x 2 ) 

(4) fix) = = dividing top and bottom by x 2 . 

x 2 - 4 1 - (4/x 2 ) 

1/x, 1/x 2 and 4/x 2 all tend to zero when x becomes large, so fix) — > 0/1 = 0 for both 
large positive and negative values of x. This means that y = 0 is a horizontal asymptote. 

dy (x 2 - 4)(1) - (x - l)(2x) 2x - 4 - x 2 

(5) dx (x 2 -4) 2 (x 2 -4) 2 
so dyldx = 0 if x 2 - 2x + 4 = 0. 

There is no value of x on the x-axis which fits this equation since 'b 2 - 4ac' = -12. 
In other words, this equation has no real roots. 

Therefore, there are no turning points. This curve never has a horizontal tangent. 
I show a sketch of it at the end of the answers to this exercise. 



(a) y=fix) = 
(1) AO) - 



(b) y = g{x) = - 

1 + x 2 

(1) = 0 so y = g(x) cuts the x-axis at (0, 0). This answers (2) as well. 

(3) 1 + x 2 is always positive so no value of x has to be excluded. 

x 1/x 

(4) g(x) = = dividing top and bottom by x 2 . 

1 + x 2 (1/x 2 ) +1 

1/x and 1/x 2 both tend to zero when x becomes large, so again y = 0 is a horizontal 
asymptote. 

dy (1 + x 2 )(l) - (x)(2x) 1-x 2 

(5) — = — = — = 0 if 1 - x z = 0 

dx (1 +x 2 ) 2 (1 +x 2 ) 2 

so dyldx = 0 if x = -1 or x = +1 since 1 - x 2 = (1 - x)(l + x). 

Passing through x = — 1, the sequence of values of dyldx goes - 0 + so there is a 
local maximum of g(-l) = -5 at the point (-1, -j). 

Passing through x = +1, the sequence of values of dyldx goes + 0 - so there is a 
local maximum of g(+l) = 5 at the point (+1, 5). 

I show a sketch of this curve at the end of the answers to this exercise. 

It is another example of an odd function. If you turn the page upside down, the curve 
looks unchanged. It has its one moment of glory near the origin, and then sinks back to 
the x-axis. 
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4 

y = h(x) = x + — 
x 

(1) h(0) is undefined. There is a vertical asymptote of x = 0. 

(2) h(x) = 0 if x + 4/x = 0 or x 2 = -4. 

There are no values of x on the x-axis which fit this equation, so h(x) never cuts the 
x-axis. 

(3) We have already decided in (1) that we must exclude x = 0. 

(4) As x — > oo we have 4/x — > 0 so h(x) gets closer and closer to the line y = x. 

The same thing is true if x — > - °°, so _y = x is an asymptote ofy = h(x). 

(5) cfy/Jx = h'(x) = 1 - 4/x 2 so h'(x) = 0 if x = -2 or x = +2. 

d 2 y/dx 2 = 8/x 3 so if x = -2, d 2 yldx 2 is negative. 

Therefore, there is a local maximum of A(-2) = -4 at the point (-2, -4). 
If jc = +2, d 2 y/dx 2 is positive so there is a local minimum of h(2) = 4 at the point 
(2,4). 

Notice that the local minimum value is larger than the local maximum value! This is 
possible because the function h(x) has a discontinuity at x = 0. The curve does a jump 
there. 

I show a sketch of this curve at the end of the answers to this exercise. 
9 

y=p(x)=x-- 
x 

This looks quite like (c) but the change in sign makes an enormous difference to the 
behaviour of the function and the shape of its curve. 

(1) p(0) is undefined, There is a vertical asymptote of x = 0. 

(2) p(x) = 0 if x - (9/x) = 0 so x = 9/x so x 2 - 9 = 0 so x = -3 
or x = +3. 

The curve cuts the x-axis at (-3, 0) and (+3, 0). 

(3) We have already decided that we must exclude x = 0. 

(4) As x — > oo ; p(x) — > x just like example (c) above. 

(5) dyldx = p'(x) = 1 + 9/x 2 = 0 if 9/x 2 = -1 or x 2 = -9. 

There are no values of x on the x-axis which fit this equation therefore this curve has 
no turning points. 

I show a sketch of it at the end of the answers to this exercise. 

Both (c) and (d) are also odd functions. Try turning the page upside down. 

y =fix) = x 2 e x 

(1) J{0) = 0 so the curve passes through the origin. In fact, it has the repeated root x = 0 
there, so it just touches the x-axis. 

(2) has been answered by (1). There are no other solutions of fix) = 0 since e x is never 
equal to zero. 

(3) There are no values of x which must be excluded. 

(4) x 2 e x becomes very large, as x becomes very large and positive. 

When x becomes large and negative, e x gets smaller faster than x 2 gets larger, so 
x 2 e x becomes very small. 



Answers to the exercises 



(5) dyldx = e x (2x + x 2 ) = 0 if x = 0 or x = -2. 

rfV^ 2 = O + x2 ) eX + e* ( 2 + 2 *) = e*(x 2 + 4x + 2). 

If x = 0, d 2 y/dx 2 is positive, so y = 0 is a local minimum. 

If x = - 2, d z yldx z is negative, so >> = 4/e 2 is a local maximum. 

This gives the sketch which I show below. 
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Exercise 8.E.3 

(1) I show a drawing for this question in diagram (1) below. 



© 



'{t;;..'.- ;:: 
















i.i,".' -. 



4> 




The volume of the box is given by V = (10 - 2x)(10 - 2x)x = lOOx - 40x 2 + 4x 3 . 
JF/dx = 100 - 80x + 12x 2 so dVldx = 0 if 3x 2 - 20x + 25 = 0 

or (3x - 5)(x - 5) = 0. The solutions to this equation are x = 5 or x = f . 

Clearly, x = 5 gives the minimum value. After removing these squares there would be no 
cardboard left! 

Checking for absolute certain that x 
d 2 V/dx 2 = 6x - 20 which is negative if x = f. 



3 does give a maximum value for V, we have 



So the maximum volume of the box is (y)(y)(f) = 2000/27 = 74.1 cm 3 to 1 d.p. 

(2) I show a cross-section along the axis of the cylinder in diagram (2) above. 
If V is the volume of the cylinder, then V = xcr 2 h. 

We now need to use the physical relationship of the cylinder to the sphere to find r in 
terms of h and R. 

We can do this by using Pythagoras' Theorem in triangle AOS. This gives us 



R 2 = r 2 + 



so 



r 2 = R 2 



h 2 
+ — 

,2/4 4 
It is easier this time to substitute for r than h, so we have 

h 2 \ . h 3 



V = Jtr 2 h =n\R 



h = JiR 2 h -71- 
4 / 4 



Differentiating V with respect to h we get 

dV 



dV h 2 
— = jiR - 3tt — so 
dh 4 dh 



= 0 if 3h z = 4R 2 



or 



h = 



2R 



Differentiating again, we get d 2 V/dh 2 = - 6jth/4 which is negative since h is positive, so 
this value of h does give us the maximum volume of the cylinder. 

Also, R 2 = r 2 + (h/2f so if h = 2R/43 we have R 2 = r 2 + R 2 /3. This gives 

r=^R. 

'2tf 2 ^ 
3 



The volume of the cylinder = nr 2 h = Jt 



4ttR- 



3V3 



The volume of the sphere is 3 jc R 3 so the largest cylinder fills l/\' 3 of the hollow inside 
the sphere. l/\'3 is 0.577 to 3 d.p. so it does fill more than half of it. 

(3) In Section 5.D.(f) we showed that 4 sin t + 3 cos t= 5 sin(? + 0.6435). 

The maximum value for a sin is one, so the maximum distance of the particle from the 
origin is 5 units. 
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Exercise 8.E.4 



(1) We want to solve 2x 3 - 3x 2 + 6x + 1 = 0, so we need to have some idea of the shape of 
fix) = 2x 3 - 3x 2 + 6x + 1. We can see that /(0) = 1 and f'{x) = 6x 2 - 6x + 6. Try using this 
to look for turning points if you got stuck finding the roots of this equation. 



fix) is never equal to zero because its 'b 2 - 4ac' is negative. Also, since the coefficient 
of x 2 is positive, fix) is always positive, meaning that / (x) always has a positive gradient. 
It must look something like graph (1) below, and it only has one root. 

/(-I) = -10, so this single root lies between x = 0 and x = -1 since fix) is continuous, 
and it is closer to x = 0. 

Using the Newton-Raphson rule starting with x x = 0 gives us 

x 2 = 0 - 6 = - 6- 



1 



-0.0741 



x 3 = -z - = -0.156 to 3 d.p. 

1 7.1667 ' 

Repeating the process gives x = -0.156 to 3 d.p. again, so this is the required solution. 

(2) The sketch of e x — 3 - x, in graph (2) below, shows the single solution. 

We have to solve e x - 3 + x = 0, so we put fix) = e x - 3 + x, giving f'(x) = e x + 1. 
f(l) = e - 2 = 0.718 and /(0.5) = -0.851 so the solution lies between these two values 
since fix) is continuous. 

/0.718\ 

Starting with x, = 1 gives x 2 = 1 - = 0.807 to 3 d.p. 

\3.718/ 

/ 0.0482 \ 

x 3 = 0.807 - = 0.792 to 3 d.p. 

\3.2412/ 

Repeating the process confirms this answer to 3 d.p. 

(3) We can see from sketch (3) below that (i) has just the single solution of x = 0 while (ii) has 
three solutions symmetrically placed, (y = x is a tangent here, just like it was to y = tanh x.) 

Let fix) = sinh x - 2x so fix) = cosh x - 2. 

Using a calculator, /(l) = -0.825, /(2) = -0.373 and/(3) = 4.018 so there is a solution 
between x = 2 and x = 3 since fix) is continuous. 

/- 0.373 \ 

Starting with Xi = 2 gives x 2 = 2 - = 2.212 to 3 d.p. 

\ 1.762 / 

Repeating the process three more times confirms the answer as x = 2.177 to 3 d.p. 
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Exercise 8.F.2 

In these answers, I shall use the shorthand 'w.r.t.' to mean 'with respect to'. (This is quite 
commonly done in maths.) 

(1) Differentiating x 2 y 2 = 25 implicitly w.r.t. x gives 

/ dy\ dy 2xy 2 y 

2xy 2 + x 2 \ 2y — = 0 so — = - — — = - -. 

\ dx) dx 2x y x 

The gradient of x 2 y 2 = 25 at (1, 5) is -5. 

The equation of the tangent here is y - 5 — -5(x - 1) or y + 5x — 10. 

(2) Differentiating x 2 + 3xy - y 2 — 2 implicitly w.r.t x gives 

dy dy 
2x + 3y + 3x — -2y — = 0 
dx dx 

dy -2x - 3y 2x + 3y 

so — = = . 

dx 3x — 2y 2y — 3x 

g 

The gradient of the curve at (1, 2) is j = 8. 

The equation of the tangent here is y - 2 — 8(x - 1) or y — 8x - 6. 

(3) Differentiating My + 2/x = 1 implicitly w.r.t. x gives 

1 dy 2 dy -2/x 2 

----- = 0 so — = = -2y 2 lx 2 . 

y dx x dx My 

8 1 

The gradient of the curve at (4, 2) is - jg = -j. 

The equation of the tangent here is y — 2 = -5 (x - 4) or 2y + x = 8. 
Alternatively, multiplying the original equation by xy gives x + 2y = xy. 
Differentiating this implicitly w.r.t. x gives 

dy dy dy y - 1 

1+2 — =y + x — so — = . 

dx dx dx 2 — x 



This looks alarmingly different from the first answer, but substituting x = 4 and y = 2 gives 
dyldx = as before, and therefore, of course, the equation of the tangent is the same. 
Because x and y are related by fitting the equation of the given curve, dyldx will give the 
same answer for any particular point, using either form. 

(4) (a) Putting X = 2x - 5 and y = sin -1 X, we have 

dy I 2 1 

X 2 = 



dx fl -(2x-5) 2 x 2().v - 4x 2 - 24 V5x - x 2 

We must have -1 < 2x - 5 < 1 so 4 < 2x < 6 or 2 < x < 3. 

(b) Putting X = 3x + 1 and y = cosh -1 X, we have 

dy 1 3 

X 3 



dx V(3x + l) 2 - 1 ' V9x 2 + 6x' 

We must have 3x + 1 > 1 so x > 0. 

(5) y = tanh -1 x so tanh y = x and differentiating implicitly w.r.t. x gives 

dy dy 1 1 1 

1 = sech y — so 



dx dx sech 2 y 1 - tanh 2 y 1 - x 2 

(This working uses results from Section 8.D.(c).) 

The answer we have found agrees with the answer to question (3) of Exercise 8.D.3 
where we used 



tanh 1 x = 1 In 



1 +x 



1 -x, 

From Figure 8.D.8 we see that -1 < x < 1, and y = tanh 1 x has a positive gradient. 
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(6) (a) lfy = x x then In y = In (x x ) = x In x so My dyldx = In x + x (1/x) = In x + 1 
Therefore cfy/Jx = y(\n x + 1) = (In x + I) x x . 

(b) If y = 3x (x + 2) then In y = In (3x (x + 2) ) = In 3 + In (x (x + 2) ) = In 3 + (x + 2) In x. 

(The splitting up is a bit tricky here! It uses both the first and third rules of logs.) 
Differentiating implicitly w.r.t. x gives 

1 dy (\\ 2 
= 0 + lnx + (x + 2)-=lnx+l+- 

y dx \x J x 

so — = I lnx + 1 + -1 3x (x + 2) . 



dx 

(c) y = (3x/ x + 2) so In y = In ((3x/ x + 2 >) = (x + 2) In 3x. 

(This is what you also get for (b) if you aren't careful!) 
Differentiating implicitly w.r.t. x gives 

1 dy / 3 \ 2 

= In 3x + (x + 2) — = In 3x + 1 + - 

y dx \3x/ x 

dy ( 2\ ^ 

so — = In 3x + 1 + - (3x) (x + 2) . 
dx \ x 



Exercise 8.G.1 

2t At 2 8? 3 (2tJ 

(1) Putting 'f = It, we get e 2 ' = 1 + — + — + + . . . + + . . . 

1! 2! 3! r! 

t t 2 t 3 f 
Putting 'f = -f, we get e~' = 1 - — + — - — +... + (-l) r — + . . . 

1! 2! 3! r! 

(2) (a) Putting 'f = 2f, we get 

2f 8f 3 32? 5 (2?) 2r+1 » (2/) 2r+1 

sin 2t = + ... +(-l) r + . . . = 2 (-If 

1! 3! 5! (2r+l)! '=o (2r + l)\ 

t t 3 t 5 f 

(b) Putting 'f = -t, we have sin (-t) = - — + — - — + — -...=- sin t 

1! 3! 5! 7! 

which is what we expect from the shape of its graph in Section 5.A.(a). 

(3) f{t) = cos t,f'(t) = -sin t,f(t) = - cos t and f"(t) = sin t. 

Differentiating again brings us back to where we started. 

We have/(0) = l,f'(0) = 0,/"(0) = - 1 and/"'(0) = 0, giving us the series 

t 2 t 4 t 6 t 2r - t 2r 

cos?=i- — + — - — + ... + (-iy + . . . = x (-iy 



2! 4! 6! (2r)\ '=<> (2r)! 

starting the count with r = 0. It's a good idea to check that you've written down the 

rth term correctly by putting r = 0, 1 and 2 and checking that you do actually get the first 

three terms of the series. Then you can do any necessary tweakings to make it right. 

(a) Putting 'f =2t in the series above, we get 

At 2 16/ 4 64t 6 (2t) 2r 

cos 2t = 1 + + . . . + (-1Y + . . . 

2! 4! 6! (2r)\ 
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t 2 t 4 t 6 

(b) Putting 'f = -t gives us cos (-t) = 1 - — + — - — + ... = cos t. 

2! 4! 6! 

as we expect it to be, from the shape of the graph in Section 5.A.(a). 

(c) Similarly, putting '? = nt gives us 

n 2 t 2 n 4 t 4 n 6 t 6 (nt) 2r 

cos nt=l- + + . . . + (-1Y + . . . 

2! 4! 6! (2r)\ 

(4) fit) = sinh t so f'(t) = cosh t and/"( f ) = sinh / and so on. 

From this, we have f(0) = 0,/'(0) = 1 and /"(0) = 0, and so on. This gives 



f ? 3 ? 5 
sinh t = — + — + — + 
1! 3! 5! 



t 2r+ 1 



(2r + 1)! 



+ ...= X 



t 2r + 1 



'=o (2r + 1)! 



f f 2 t 3 t 4 



Adding the series gives sinh t + cosh t = 1 + — + — + — + — + 

1! 2! 3! 4! 



Chapter 9 

Exercise 9.A.1 

(1) Iff(x) = e x then y =f(x) = e x + C. Putting j = 2 and x = 0 gives 2 = e° + C= l + Cso 
C = 1, giving us j = e x + 1. 

(2) If g'(*) = sinh x thenj = g(x) = cosh x + C. (See Sections 8.D.(a) and (b) if necessary.) 
Putting y = x = 0 gives 0 = cosh (0) + C = 1 + C so C = -1 giving y = cosh x - 1. 

(3) If h'(x) = 3x 2 then y = h(x) = x 3 + C. Putting j = 2 and x = 1 gives 2 = 1 + C so C = 1 
giving y = x 3 + 1. 

The three curves look like my drawings below. 



L) =Coshjc-) 




CD 



-z 



|-X 3 4-| 



Exercise 9.A.2 

(1) lfx = 5t 3 + 3t 2 then (a) 1/ = dxldt = I5t 2 + 6t and (b) a = d 2 xldt 2 = 30? + 6. 

(2) If v = 3t 2 + It then 

(a) x = t 3 + t 2 + 5 since x = 5 when t = 0. 

(b) a = dvldt = 6t+2. 

(3) If a = d 2 xldt 2 = 2t+\ then 

(a) v = dxldt = t 2 + t + 5 since v = 5 when / = 0. 

(b) x = j/ 3 + j? 2 + 5/ since x = 0 when t = 0. 

(4) If 1/ = 4e 2 ' then 

(a) x = 2e 2 ' - 1 since x = 1 when f = 0 and e° = 1. 

(b) a = dvldt = 8e 2 '. 
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(5) If v = dxldt = - sin t then 

(a) x = cos t since x = 1 when t = 0. 

(b) a = dvldt = d 2 xldt 2 = - cos t so we have d 2 x/dt 2 = - x. 

The acceleration is towards the origin and equal to the distance from the origin so this is 
an example of SHM. (For more information on SHM see Sections 5.C.(b) and 8.A.(e).) 

(6) v = dxldt = cos (t + jr/6) so 

(a) x = sin (t + ji/6) since x = \ when t = 0. 

(b) a = dvldt = d 2 xldt 2 = - sin (t + Jt/6) so, again, d 2 xldt 2 = - x giving another example of 
SHM. 

Exercise 9.A.3 

(1) j' (2t 2 + 3t+ \)dt = [| / 3 + \t 2 + t] 3 i = (|(27) + |(9) + 3) - (| + | + 1) = 3l|. 

(2) (2t 3 -t)dt=[\t 4 -\ t 2 ^ = (128 - 8) - (0) = 120 

rji/6 r I jt/6 / — 

(3) sin x dx = - cos x\ = - cos (jt/6) + cos (0) = - \ 3/2 + 1. 
Jo L Jo 

The values of the sin, cos and tan of certain particular angles are often useful in 
integration. You will find these special values in Section 4.A.(g). Also, if you have trouble 
remembering which way the signs go when you differentiate and integrate sin and cos, 
remember the rule 'Solve Damn Problem or Sin Differentiates Plus'. The other three are 
then easy to get. 

The sketches for questions (3), (4), (5) and (6) are shown below. 




(4) 2 sinx dx= \-2 cos xV = 2(- \/3/2 + 1) = 2 - V3. 
Jo L Jo 

The effect of the 2 is just to double the integral so doubling the area, as you can see 
above. It can be taken outside as a factor, giving 2 Jq' 6 sin x dx if we want. 

rjr/6 r \ -ijt/6 \ \ 111 

(5) sin 2x dx = \ -5 cos 2x\ = -5 cos(jt/3) + 5 cos(0) =-4 + 5 = 4 
Jo L Jo 

See the sketch above. Notice that this isn't twice (3)! If you had any trouble with these 
sketches, you should go back to Section 5.C.(a). 
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(6) 



f cosh x dx = [sum xl = sinh(l) - sinh(O) = 1.175 - 0 = 1.175 to 3 d.p. 
Jo L J 0 




using a calculator to evaluate sinh(l). The sketch is on the previous page. If you need 
help with this, go back to Sections 8 D.(a) and (b). 



(7) 



rji/4 r 3 n jt/4 3 3 3 

3 cos 2u du = 2 si n 2u\ = i sin(jr/2) - i sin(O) = i 




(9) 



(8) 



e 2x dx = [5 e 2 *]^ = \ e 4 - \e 2 = 23.6 to 1 d.p. 
J' e 3x+l dx = [je 3l+1 ]^ = |e 4 -|e 1 = 17.3 to 1 d.p. 





(10) 
(12) 
(14) 
(16) 
(17) 



f(t 2 + 2t)dt = \t 3 + t 2 + C (1 1) / sin 3x dx = -f cos 3x + C 

/ cos (2x + nil) dx = \ sin (2x + n/2) + C (13) / sinh 2u du = \ cosh 2u + C 

f e 5x dx = \e 5x + C (15) J e 2x + 3 dx = \e 2x + 3 + C 

x = / (St 3 + t)dt = 2t 4 + 5 t 2 + C, but x = 2 when f = 0 so C = 2. 

1 13 

x = / sin 2t dt = -5 cos 2t + C, but x = 1 when f = 0 so -5 + C = 1 and C = 5. 



(18) x = / (f + e 2 ') A = \ t 2 + 5 e 2 ' + C, but x = 0 when t = 0 so \ + C = 0 and C = -\. 



Thinking point after Exercise 9.A.3 

In graph (c), a = 0 when t = f. At this instant, the particle's acceleration is zero. It changes from 
negative (the particle was under some kind of decreasing drag) to positive. (The particle is now 
receiving some kind of increasing forwards push.) 

This is reflected in graph (b). We see the particle's velocity decreasing up to t = 5 and then 
increasing again. The curve has a negative slope up to t = 3 (dv/dt is negative), zero slope at t = 5 
(dv/dt = 0), and a positive slope when t > 5 (dv/dt is positive.) 

Graph (a) shows how this affects the distance travelled. 

We have d 2 x/dt 2 = dv/dt = a = 0 at t = 5. The graph shows how the distance is being covered 
increasingly slowly as the time gets closer to t = 5, and then faster again as the velocity picks up 
afterwards. The flattening in the graph (which gives the point of inflection where the tangent 
crosses the curve), shows that the change in the change of distance (that is, dldt (dxldt)), is 
happening increasingly slowly up to t = 3. It is zero at t = 3 and then becomes positive afterwards. 
(Remember that d 2 xldt 2 is the same thing as dv/dt.) 

You could imagine that the particle puts its brakes on at t = 0, gradually raising the pedal until 
t = 3, when both its feet are instantaneously off the pedals. After t = j, it gradually increases its 
pressure on the accelerator, and vrooms off along the positive x-axis. 

If we know what both x and v are when t = 0, we can move both forwards (differentiating), 
and backwards (integrating), between these particular curves. If we don't have this specific 
knowledge then moving backwards from (c) would give v = 3t 2 - 2t + C and x = t 3 - t 2 + Ct + K 
where C and K are constants representing the unknown amount of shift from the horizontal axis 
in both (b) and (a). 

Exercise 9.A.4 

(1) In |x + 3| + C (2) 21n \t -4| + C (3) f In |3x + 2| + C 

(4) -\ In |5 - 2x| + C (5) -4 In |3 - t\ + C (6) -\ In \5t + 4| + C 

(7) [-5 In |5 - 2x|]?! = -\ (In 1 - In 7) = 0.973 

4 

(8) This one can't be done because x = 3 (which gives a zero on the bottom of the fraction) lies 
between the limits of 1 and 3. 

(9) [\ In |3 + 2x|] Z 2 4 = \ (In |- l| - In |- 5|) = \ (In 1 - In 5) = -0.805. 
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Exercise 9.B.1 

(A) (1) 8? (2) \x m (3) 2e 2t (4) 5e 5x + 2 (5) -3e- 3 ' 

3 1 1 -11 -5 5 

(6) - = - (7) (8) = (9) — — = — — 

it t 3+x 2—x x-2 3-5? 5? -3 

(10) 3 cos 3? (11) -sin(x + 7t/3) (12) 2 cos (2x + nl6) 

(13) 2 sinh (2t + 5) (14) -5 cosh (8 - 5x) (15) 3 sin 2 x cos x 

(16) sec 2 x 

(B) (1) \ x 4 + C (2) | t y2 + C (3) | e 3 ' + C (4) 5 e 2x + 3 + C 

(5) -\e- 2t +C (6) -je 3 - 4 '+C (7) ln|x + 2| + C (8) 5ln|3x + 2| + C 

(9) -In |1 - x\ + C (10) -5 cos 5t+ C (1 1) sin (x + jr/6) + C 

(12) -5 cos (2? + jt/4) + C (13) I sinh 3x + C (14) 5 cosh (2x + 3) + C 

(15) sin 4 x+C (16) \ tan 2x + C 

(C) (1) [f w 3 + f k 2 ] 2 , = f (2) [f; 3 + ln?] 3 =f + ln3 

(3) [5 sin 4^]J /8 = \ (sin tt/2 - sin 0) = \ 

(4) [-5 cos 2x]q /6 = -\ cos tt/3 + \ cos 0 = -4 + 5 = 4 
(Remember cos 0 t- 0.) 

(5) [-2 cos (2x + tt/3)]J /2 = -\ cos (4jt/3) + \ cos (w/3) = 3 + 5 = 5 

(6) [j e 5t ] l _i = 5 (e 5 - e- 5 ) = 29.68 to 2 d.p. 

(7) [5 e 2 ' +3 ]°! = \ (e 3 - e 1 ) = 8.68 to 2 d.p. 

(8) {\ cosh 2x]l! = 0 

(You can see it will be this from Section 8.D.(a) and Section 9.A.(d).) 

(9) [tanx]^ /4 =l (10) [5 In \2u + 3|]^ = \ (In 9 - In 3) = 0.55 to 2 d.p. 

(11) [In |x + 2\ + 2 In |2x + 5|] 0 = In 3 + 2 In 7 - In 2-2 In 5 = 1.08 to 2 d.p. 

(12) [-In |2 - x\ - \ In |2x + 3|]li = - In 1 - 5 In 5 + In 3 + \ In 1 = 0.29 to 2 d.p. 

I'll give you the answers to each of the thinking point integrals when we look at the technique 
for that particular one. 



Exercise 9.B.2 



(1) Let t = 2x 2 - 3 so dtldx = 4x so we can replace 2x dx by 5 dt. Then 



2x 



dx = 



dt 



(2x 2 - 3) 4 

(2) Let t = 2x 3 + 9 so dtldx = 6x 2 . 



+ C- 



-1 



6(2x 2 - 3) 3 



+ C. 



Also, if x = 0 then ? = 9 and if x = 2 then f = 25. So 



6x 2 



dx = 



r 25 ^ 






2t m 







25 



V2x 3 + 9 

(3) Let * = x - 4 so = 1 and x = t + 4. 

Also, if x = 4 then f = 0 and if x = 5 then t = 1. So 



= 10 - 6 = 4. 



f 5 x(x-4) 9 Jx= f' / 9 (/ + 4) dt= \t u l\ \ +(4? lo )/10l 1 = 27/55. 

J4 JO L Jo 
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Exercise 9.B.3 

(1) I = In |x 2 + 3x + 9| + C (2) I = In |3x 2 + 2x + l\ + C 

(3) 1=2 In |x 2 + 4x + 7| + C 

(4) Let u = 3x 2 + 4 so duldx = 6x. Then 
du 

u 3 " V -2 



M — = S — +C = 4(3x 2 + 4)- 2 + C. 



(5) Let h = x 3 - 1 so <iw/dx = 3x 2 . Then 

(6) Let « = 1 - 4x 2 so Jm/Jx = -8x. Then 

! -^7 = -i« 1/2 + C = - 4 i Vl -4x 2 + C. 
u 



(7) Let u = 1 - x 2 so d«/<ix = -2x. Then 

I=-\\ U 1 ' 2 du = -i (4-1 - "I (1 " ^ 2 ) 3/2 + C. 



2 



(8) Let w = x + x 2 so Jw/dx = 1 + 2x. Then 

/ = \^L = 2u m + C = 2v(x + x 2 ) + C. 
J \'w 



(9) Let u = 2x-l so Jw/Jx = 2. Then 
l C du j -1 



+ C. 



w 2(2x - 7) 

Exercise 9.B.4 

(1) / = -\ cos 6 x + C (2) I=-\ cos 4 2x + C (3) 7 = \ sin 3 (x + n/4) + C 

(4) Let u = cos 3x so duldx = -3 sin 3x and -f t/w = sin 3x Jx. Then 

j f , 1 

/ = - = 5»- 3 + C = — 1— + C. 
J « 9 cos 3x 

(5) Let m = 1 + sin x so duldx = cos x. Then 



du , 1 

7 = | — = -k u- 2 + C = + C. 

w 3 " 2(1 + sin x) 2 



(6) Let u = sin x so <3?w/<fe = cos x. Then 

f du , , , , 

7 = — = In |w| + C = In |sin x\ + C. 

J u 

(7) I = \ j (cos 2x + 1) dx = \ sin 2x + \ x + C using cos 2x = 2 cos 2 x - 1. 

(8) cos 6x = 1 - 2 sin 2 3x (using cos 2J = 1 - 2 sin 2 A) so sin 2 3x = \ (1 - cos 6x), therefore 

7 = | J 1 - cos 6x cfc = 5 (x - 5 sin 6x) + C. 

(9) J cos 3 x dx = j cos x (1 - sin 2 x) dx = sin x - \ sin 3 x + C. 

(10) I = j cos 5x (1 - sin 2 5x) dx = \ sin 5x - j$ sin 3 5x + C 

(11) 7 = | j (cosh 2x + 1) dx = \ sinh 2x + \ x + C 

using cosh 2x = cosh 2 x + sinh 2 x = 2 cosh 2 x - 1 from Section 8.D.(d). 
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(12) 
(13) 



7=5 cosh 5 (x + 1) + C 

If t = sin x then <7//dx = cos x. 

If x = jt/6 then * = \ and if x = 0 then t = 0, so 

1/2 3 A f m 3dt 

1 = 



1/2 



-3 3 
— + 



, n t 2 + 5t+6 J„ (f + 3)(f + 2) J„ \; • 3 / + 2 
so we have 1 = [-3 In |f + 3| + 3 In \t + 2|]i /2 = 0.207 to 3 d.p. 



dt 



(14) 



If t = cosh x then dtldx = sinh x, so 
1 dt f / 2 

7 = 



1 



It 2 + 5f- 3 J \ 2/ — 1 ? + 3 
= In |2f- 1| - In \t + 3| + In J 



= In 



^ (2/ - 1) 



t + 3 



= In 



v4 (2 cosh x - 1) 



cosh x + 3 



(15) If x = 2 sin ? then dxldt = 2 cos t. 

If x = 1 then sin t = 5 so t = ji/6. If x = 0 then sin / = 0 so t = 0. 



2 cos ? 



V4 - 4 sin 2 f 



2 cos 7 
2 cos t 



dt 



; tt/6 



(16) 



If x = sin f then dxldt = cos t 

If x = 5 then t = jt/6 and if x = 0 then 7 = 0. 



rn/6 

Jo 



COS t COS £ fifr : 



(cos 2t+ 1) dt 



(17) 



(18) 



= 5 [5 sin It + dJT = 2 [V3/4 + tt/6] = 0.478 to 3 d.p. 

Let / = 4 + sin x so A/cfc = cos x. 

If x = 7t/2 then t = 5 and if x = 0 then t = 4. 

/ = J 5 lit 2 dt = [- = -5 + 5 = ^ 

If x = 2 cos ? then Jx/A = -2 sin 

If x = \ 2 then ? = it/4 and if x = 1 then t = jt/3. 



nIA 



2 sin t 



JT/3 



(4 cos 2 0(2 sin t) 



dt = 



•jr/3 
1 I .2 



sec ? 



4 



jr/4 



tan ? 



=hV3-i) 



Exercise 9.B.5 

(1) 'a' = 5 so 7=3 sinh" 1 (x/5) + C. (2) 'a' = 3 so 7 = 2 shr 1 (x/3) + C. 



(3) 'a' = 4 so 7= ti tan" 1 (jc/4)]|j = 



16 



0 



16- 



Exercise 9.B.6 

(1) 7 = 5 tan- 1 (f/3) + C 



(2) I- 



dx 

(x + 3) 2 + 1 



so 'x' = x + 3 and l a' = I, giving tan 1 (x + 3) + C. 
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f 2x + 4 f dx 

(3) J = — dx + 



x 2 + 4x+ 13 J x 2 + 4x+ 13 
<ix 



In | x 2 + 4x + 13| + 



(x + 2) 2 + 9 



= In | x 2 + 4x + 13| + 5 tan" 1 I — ^— I + C 

f At + 16 - 13 f 2? + 8 f A 

(4) /= * = 2 #-13 

J ? 2 + 8?+17 ] t 2 + 8t+n ) (t+4f+\ 

= 2 In |? 2 + 8? + 17| - 13 tan- 1 (/ + 4) + C 



(5) 1=1 — g- = , , g 

J 4(x 2 +|) 4 J x 2 + l 

\ X f tan- 1 (2x/3) + C = | tan" 1 (2x/3) + C with 'a' = f 



(6) / = 



3Jx j f dx 



3 | 2 . 16 



9(* + f ) J x 2 + 
X | tan" 1 (3x/4) + C = j tan" 1 (3x/4) + C with V = f 



6m + 4 j f 6w + 18 - 14 

(7) / = j = J du 

u 2 + 6u + 13 w 2 + 6m + 13 



2w + 6 

du -7 



u 2 + 6m + 13 J (m + 3) 2 + 4 

m + 3 s 



7 /m + 3 
In |m 2 + 6m + 13| - j tan" 1 I — ^— 



+ C 



3du 

(8) / = I so 'x' = 3m + 4 and V = 1 so Jx/Jm = 3. 

1 (3m + 4) 2 + 1 

f dx 

This gives / = = tan" 1 x + C = tan" 1 (3m + 4) + C 

J x 2 + 1 

(9) I will do this one using the second method I gave you where we put up with the fractions in 
order to use the tan -1 rule directly. We have 



i 3du 3 f du 



1 M + M + J J (m + j) + 4 

Now we put m + 5 — x and | = a which gives us 
M + ^\ / 2m + 1 



/ = 2 X 5 tan" 1 I —3— ) + C = tan" 1 ( — - — ] + C 

You might like to look at questions (8) and (9) being worked out the opposite way round in 
(2)(d) and (2)(i) of Exercise 8.D.3. 



Exercise 9.B.7 

(1) /=| ( — — I dx = 41n |x + 2| - 41n |x + 3| + C 

J \x + 2 x + 3 
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= I In |x - 2| - | In |x + 3| + + C 

x + 3 

(3) This one often causes problems although it looks so simple. The important thing to notice 
about it is that it counts as top heavy, so we say 

x- 1 f x + 1 -2 f 2 

dx = | dx= \ 1 dx = x - 2 In |x + 1 1 + C 



x + 1 



X + 1 



X + 1 



(4) | 4 , 4V . ) dy 4 In M 2 In [r 1 1 C 



(5) 1 = 



1 



2t-\ t + 2 (t + 2f 



dt 



\ In \2t -\\-\\n\t+2\- + C 



(6) / = 



1 y-9 



t + 2 



dy = ln\y-l\ 



y-\ y 2 + 9/ ' J J 2 + 9 

In [y - 1| - 5 In [y 2 + 9| + 3 tan" 1 (y/3) + C 



y I 9 

— -<fy 



y + 9 



(7) / = 



x — 1 x + 3 x — 3 



dx 



= -| In |x - 1| - f In |x + 3| + f In |x - 3| + C 



(8) I 



t 2 + 1 
? 2 - 1 



1 + 



dt 



t 2 - 1 + 2 
/ 2 - 1 



1 + 



/ 2 - 1 



f- 1 t+ 1 



(9) / = 



x 2 + 1 
x 2 + 6x + 8 

1 + — 



dx 



<#= ? + In \t- 1| - In \t+ 1| + C 



x 2 + 6x + 8 - 6x - 7 f (6x + 7) 

Jx = 1 - — dx 



x 2 + 6x + 8 



x 2 + 6x + 8 



17 

2 1 dx = x + f In |x + 2[ - y In |x + 4| + C 



x + 2 x + 4 



Don't forget the 1 when you rewrite the integral after finding the partial fractions. 



(10) First, factorise y 3 - y 2 as y 2 (y - 1). Then you will find 
I 



(11) I- 



11 1 , 

- - — + ] dy = - In |y| + My + In \y - l| + C. 

y y y - i , 



2 1 3 

M M 2 2u + \ 



du = 2 In |m| + 1/m - | In |2« + l| + C 
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(12) Let / = e x so that dtldx = e x . Remembering that e 2x = (e x ) 2 , we can then rewrite I in the 
form 

7= | = I ( — *— - — *— | dt = In If + 2| - In If + 3| + In A. 

J f 2 + 5/ + 6 J \f + 2 f + 3/ 

Now we can combine this answer into a single log, and we must also replace t by e x . 
Doing this gives 

(e x + 2\ 
I = In A\ . 

\e x + 3/_ 

If you had any difficulties finding any of these partial fractions, you will find that all of them 
come in the answers to the exercises of Section 6.E. 

Exercise 9.B.8 

(1) (a) duldx = 5e 5x (b) duldt = 4cos At (c) v = \ e 4x (d) v = -e~' 

(e) duldx = 2/2x = l/x 

(Watch out for this one! The usual mistake is to say it is l/2x.) 

(f) v = \ cosh 2f (Section 8.D) (g) v = -f cos 3x (h) duldx = -2 sin 2x 

(2) I=J 2t sin 3f dt 

Let u = 2t so duldt = 2 and dvldt = sin 3f so v = -\ cos 3f. 

2 2 2 2 

This gives / = -5 / cos 3f + 5 / cos 3f df = -p cos 3f + 5 sin 3f + C. 

(3) 7 = / xe x tfx 

Let m = x so fi?M/dx = 1 and Jv/cfc = e x so v = e x . 
This gives / = xe x - J e x dx = xe x - e x + C. 

(4) 1= J x 2 sin x dx 

Let u = x 2 so duldx = 2x and dvldx = sin x so v = -cos x. 

This gives / = -x 2 cos x - / -2x cos x dx = -x 2 cos x + 2 / x cos x (fx. 

For 7j = / x cos x dx, we let u = x so duldx = 1 and fi?v/cfc = cos x so r = sin x. 

This gives / = -x 2 cos x + 2(x sin x - / sin x dx) = -x 2 cos x + 2x sin x + 2 cos x + C. 

(5) (a) 7 = J 2 x 2 In x dx 

Let w = In x so dw/cfo = l/x and dvldx = x 2 so v = \x 3 . 

This gives I = [| x 3 In x] 2 - J 2 5X 3 . 1/xdx = | In 2 - | [x 3 /3] 2 = | In 2 - | 

(b) / = Jl l/x In x dx 

Let w = In x so dw/dx = l/x and dvldx = l/x so v = In x. Doing this gives 

/ = [(In x) 2 ] 2 - J 2 l/x In x dx = [(In x) 2 ] 2 - 7 so 7 = 5 (In 2) 2 . 
This one can also be done by substitution. 

If we let In x = t then dtldx = l/x and 7 = /|, n2 t dt = [\ t 2 ] ] ^ 2 = \ (In 2) 2 as before. 

(c) / = J 2 x 2 (In x) 2 dx 

Putting u = (In x) 2 and dvldx = x 2 gives duldx = 2/x In x and v = | x 3 . 
This then gives 

^ = [5 x3 ( m x ) 2 ] 1 _ Si 5 x3 - 2/x In x dx = § (In 2) 2 - | J 2 x 2 In x dx 
= f (In 2) 2 - I (| In 2 - |) from (a), giving / = f (In 2) 2 - y In 2 + ^. 
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(6) I 



s: 



tan x dx 



Let u ~ tan x so Jm/Jx 



I 



This gives 7" = 



jx tan x 



1 +x 
r 1 

1 



and dvldx = x 2 so v = | x 3 . 



0 •'o 



1 +x 



1 , , f 1 x 3 

dx = 3 (jr/4) - 3 



1 +x z 



dx. 



This fraction is top-heavy so we cunningly rewrite the top as x(l + x 2 ) - x. This then 
gives us 



I=n/\2- 



1 +x 



Jx = tt/12-3 



x 2 /2-jln(l +x 2 ) 



= 7t/12-£(l-ln2). 



(7) 7" = Jo e x sin x dx 

If we let u = e~ x and dv/dx = sin x, we have dtddx = -e~ x and v = - cos x. This then 
gives us 

["1 71 rjz rn 
e~ x (- cos x) - (- cos x)(-e~ x ) dx = e~ n + 1 - e~ x cos x dx. 
Jo Jo Jo 

Using integration by parts again on this second integral gives 

I=e~ n +\- i^e~ x sin x]* - J" - e~* sin x dx} = e~ n + 1 - 7 
so 7=| (e" 51 + 1). 

(8) I = S e~' cosh 2t dt 

I'll let w = e~' so dtdd? = -e~* and dv/d? = cosh 2t so f = 5 sinh 2t. 

11 

This gives / = 2 sinh 2f + 5 / e~* sinh 2t dt. 

Now I integrate / e~' sinh 2/ dt by parts, and I must have u = e~' and dvldt = sinh 2? so 
duldt = -e~' and v = 2 cosh 2?. This then gives me I = 2 e~ l sinh 2t + 2 (2 cosn 2/ + 2 7) 
so 7 = I e~* (2 sinh 2* + cosh 2t) + C. 

(9) 1= J sin 4x sin 3xdx = \ /(cos x - cos 7x) dx = 5 (sin x - ^ sin 7x) + C using 
sinJ sin B = 2 [cos (A - B) - cos (^4 + 73)]. 

(10) 7 = / cos 3t cos 6/ dt = / cos 6/ cos 3f df 

I've swapped the order here to avoid minus signs. I can do this because the 
multiplication is the same either way round. This then gives me 

7=5/ (cos 9t + cos 3/) dt = (sin 9t + 3 sin 37) + C 

using cos y4 cos B = \ [cos (^4 + 73) + cos (^4 - 73)]. 



Exercise 9.B.9 

fjr/2 



rjr/2 rjr/2 _ . 

(1) (a) cos" x dx = cos x cos" x 
Jo Jo 



dx 



Let m = cos" 1 x so dzddx = -(n - 1) cos" 2 x sin x and dvldx = cos x so i> = sin x. 
Using integration by parts gives 



so 



[_ . "I Till fJl/Z _ ? 

cos" x sin x + (n - 1) sin x cos" x dx 
Jo Jo 

7 = 0 + (« - 1) p /2 (1 - cos 2 x) cos"- 2 xdx = (n- 1)(7._ 2 - 7J 
Jo 



« - 1 

so «/„ = (n - 1) 7„_ 2 and 7„ = 7„_ 2 . 



rTT/2 r iji/2 rTtll r n jr/2 

(b) I 0 = \ dx = x = tt/2 and 7, = cos x dx = sin x = 1 
Jo L Jo Jo L Jo 

1 3 22 

h = 2 ^0 = nl4 and 7 4 = 4 7 2 = 3tt/16. 7 3 = 3 7 X = 3. 
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(c) The answers are the same as Example (1) because the curves forj = sin" x and 
y = cos" x from 0 to jt/2 are mirror images of each other in the line x = jt/4. I show 
this for the sin x/cos x and sin 2 x/cos 2 x pairs in the graphs below. 





-yx 



T 4 for* each 



(2) (a) [ sin nx dx = dx [-1/n cos nx\ = -1/n [cos nn - cos(-«jt)] = 0 
because cos(-wjt) = cos nn since the cos graph is even. 

r 71 r l)t 

(b) J cos nx dx = ^1/n sin nxj = 1/n [sin «x - sin (-wr)] = 1/n [0 - 0] = 0 



rjl ~ 1 rjr l r A "1 71 

(c) J sin 2 nx dx = 5 J (1 - cos (2nx)) dx = j \ x - — sin (2nx)J 

= tt/2 + nil = ji 
(See Example (4) in Section 9.B.(c) for the method here.) 

f 71 I p 

(d) (i) sin nx cos mx dx = 5 sin [(« + m)x] + sin [(« - m)x] dx 

J-n J-Ji 

(see Example (8) in Section 9.B.(f) for the method here.) 



1 



cos (« + m)x 



1 



(n + m) (n - m) 

because cos (n + m)x = cos [-(« + m)x], and the same for (n - m), 

(d) (ii) If n = m, we have 

/ = sin nx cos nx 

J -JT. 



cos {n - m)x 



1 r 77 

dx = 5 sin 2nx dx 

J -71 



1 r 1 i 71 
= 2 -2H cos 2«x = 0. 

This uses sin 2A = 2 sin A cos A from Section 5.D.(d). 

(3) Let /„ = j cosh x cosh" - 1 x dx because we can easily integrate cosh x. 

Now put u = cosh" - 1 x so duldx = (n — 1) cosh" -2 x sinh x, and dv/dx = cosh x so 
f = sinh x. This gives 

4 = (cosh" ~ 1 x) (sinh x) - j (n - 1) sinh 2 x cosh" ~ 2 x dx 

= cosh" - 1 x sinh x - (n - 1) J cosh" -2 x (cosh 2 x - 1) dx 

using cosh 2 x - sinh 2 x = 1 from Section 8.D.(a), so 

/„ = cosh" - 1 x sinh x - (n - 1) /„ + («- 1) /„_ 2 
and, rearranging, we have n/„ = cosh" - 1 x sinh x + (n-l)/„_ 2 . 
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(4) We say 



4 = J sec 2 x sec " 2 x dx, 

splitting the integral in a similar way to Example (3). But this time we do use integration by 
parts. 

We let dvldx = sec 2 x so v = tan x and u = sec" -2 x. Then 

duldx = (n - 2) sec" ~ 3 x sec x tan x = (n - 2) sec" ~ 2 x tan x 

because d/dx (sec x) = sec x tan x (Section 8.C.(e)). This gives us 

I„ = sec" ~ 2 x tan x — J (n — 2) sec" ~ 2 x tan 2 xg!x. 

Now we use the identity tan 2 x + 1 = sec 2 x to get the integral all back in terms of sec x. 
This gives us 

I„ = sec" ~ 2 x tan x - (n - 2) J (sec" x - sec" ~ 2 x) cfc 

= sec" -2 x tan x - (w - 2) I„ + (n - 2) 4-2 
so (n - 1) 4 = sec" -2 x tan x + (n - 2) 7„_ 2 . 
Now, 

7 6 = | sec 6 Jx and 5/ 6 = sec 4 x tan x + 4I 4 , 
4/4 = 4X5 (sec 2 x tan x + 2I 2 ) and 7 2 = j se c 2 x dx = tan x 
so, tidied up, we have 

7 6 = 75 tan x (3 sec 4 x + 4 sec 2 x + 8). 
Finally, tan(7t/4) = 1 and tan 0 = 0 and sec(jr/4) = V2 and sec 0 = 1 , so we have 

rjr/4 



J" sec 6 x clx = (1) (12 + 8 + 8) - 0 = ff. 



Exercise 9.B.10 

In these answers, I shall quote the results we have just found in this section. You might need to 
show these yourself for exam purposes. 

M / 1 \ 2dt 

(1) This becomes 



0 \ 



2t \ fl-t 2 
2 + 3 I rl + 2 



1 + V 



\+t 2 



2dt 



0 2(1 + t 1 ) + 6t + 2(1 - t 2 ) } Q 3t + 2 



l+t 2 JI 

(using t = tan ill A = 1 if x = nil, and t = 0 if x = 0) 

1 dt , 

I In |3< + 2| 



= j (In 5 - In 2) = 0.305 to 3 d.p. 



(2) This becomes 



f 1 / 



1 



1 - f 

0 \ 1 + lTT7 2 // 



2 a 
1 + f 



dt (nice!) 



L Jo 



= 1. 
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(3) This becomes 



o \ 



2<ft 



2 + 



r + 4? + 3 



1 -t 2 
TT7 2 

V3 



0 



+ 2 



2/ 



1 +' 2 // 



t + 3J \t+ 1 



2<ft 

77T 2 



1V3 



= [- In |f+3| + In |? + 1|] = In (V3~ + 1) - In (vl + 3) + In 3 



In 



3(1 + V3) 

3 + \/3 



In 



3(1 + V3) (3 - V3) 
(3 + V 3) (3 - V 3) 



= In (V3). 



Exercise 9.C.1 

(1) j y dy = j cos x dx so \y 2 = sin x + C. 

Putting y = 1 when x = Jt/6 gives C = 0 so we have y 2 = 2 sin x. 

(2) j \lx dx = J 3 * so In |x| = 3? + C. 

If we let C = hi A, we get In \x/A\ = 3t so xA4 = e 3t and x = Ae 3t . 
Putting x = 1 when t = 0 gives ^ = 1 so x = e 3t . 

(3) J 1/y 2 dy = j 1/x 2 Jx so -My = -\lx + C or My + C = Mx. 

Putting y = \ and x = 1 gives C = -1. 

(4) j y 2 dy = j x 2 dx so ^y 3 = |x 3 + C or y 3 = x 3 + K where K = 3C. 

Putting j = 2 when x = 1 gives K=l. 

(5) j ^ 2 ^ - j dy = j — so In [y 2 + l| = In |x| + hi A = In H- 

From this, y 2 + 1 = ^x. Putting y = 0 and x = 1 gives .4 = lsoy 2 =x-l. 

(6) j Mydy = 2 j so In [y| = 21n |x - 5| + In ,4 = In |^(x - 5) 2 | so y=A(x- 5) 2 . 

Putting j = 2 when x = 4 gives ^4 = 2. 

f f cosy f 1 1 9 

cot y dy = dy = x dx so In |sin y\ = 5 x + C. 

J J siny J 

Putting y = jt/2 when x = \'2 gives C = -1. 

I — — = I sec 2 0 c?0 = I 7 dr so tan 0 = In |r| + In ,4 = In \Ar\. 
J cos 2 0 J J 

Putting r = 2 when 0 = 0 gives In 2A = 0 so 2^4 = 1 and A = \. 



(7) 



(8) 



(9) You need to be a bit crafty here, and say 

e x+y = e x e y so j e y dy = j xe~ x dx. 
Using integration by parts gives 

e y = -xe~ x + j e~ x dx so e y = -xe~ x - e~ x + C. 
Putting y = 0 when x = 0 gives C = 2. 

d y f . . . _, , 1 

= x + 

Putting y = 0 when x = 0 gives C = 0. 



(10) j -j j = J(l+x 2 )cfc so tan" 1 y = x + 5X 3 + C 
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Exercise 9.C.2 

dc [ dc f 1 

(1) (a) - — = kc 2 so - — = \ k dt so - = kt + A 



dt J c 2 J c 

where A is a constant, c = c 0 when f = t 0 so ^4 = l/c 0 . 

dx f dx f 

(b) — = k(a - x)(b - x) so = k dt. 

dt J (a - x)(b - x) J 

1 P Q 

= + to find partial fractions. 

(a - x)(b - x) a - x b - x 

so 1 = P(b - x) + Q(a - x). 

1 1 

Putting x = b gives Q = and putting x = a gives P = = -Q 

a - b b - a 

1 1 / -1 1 

so = + 



(a - x)(b — x) a — b \a - x b-x 
Now we can say that 

1 f / -1 1 

+ I dx = k I dt 



a — b J \a — x b-x 

so {~~^ I ( ln ( fl ~ x ) ~ ln ( b ~ x )) = kt+ C. 

(We know physically that a > x and b > x.) 
Using the second law of logs gives 

1 



ln \ = kt+C. 

a - b ) \b — x ) 

( 1 

x = 0 when t = 0 so C = ] ln (alb), so we have 

\a - b ) 



K a — bj\ \b — x j \b ) ) \a — b j \a(b—x), 
which takes some getting through but it's all right if you do it slowly and steadily! 



(2) We start with dvldt = a so j dv = j a dt so v = at + C. 



We know that v = u when t = 0 so C = u and we now have (a) v = u + at. 
(The integration here is so straightforward that you don't have to write in the step with 
the integral signs unless you want to.) 

Now, v = — = u + at so J ds = \ (u + at) dt so s = ut + \ at 1 + K. 
dt j j 

Since s = 0 when t = 0, we have K = 0 and this gives us (b) s = ut + \ at 2 . 

v - u 

From (a) we have a = . Substituting this in (b) gives 

t 

! Iv - u\ j j lu + v\ 

s = ut + 2 r I j = + 2 ^ - 2 M? = I — ~ — I ( - ( c ) 

To get equation (d), you can substitute for t in (c) using (a). This gives 
u + v \ I v — u 



so 



las = v 2 - u 2 or v 2 = u 2 + las. 
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You can also get (d) by using the Chain Rule to say dvldt = dvlds dsldt = v dvlds. From this, 

dv dv [ [ 1 

— = v — = a so v dv = a ds so j v as + C. 
dt ds J J 

v = u when s = 0 so C = \ u 2 and we get (d) v 2 = u 2 + las. 

It is quite likely that these four equations of motion will be very familiar to you. They 
describe the simplest case of what happens if a particle is accelerating, which is when this 
acceleration is constant. 




They are only true if the acceleration is constant! 

We've now done enough other examples for you to know that, if the amount of 
acceleration depends, for example, on the length of time you have been travelling, 
then you will get very different equations for the velocity and distance after a time 
of t. 



One reason why the four equations above are physically useful is that they describe the 
motion of a body which is falling freely to the ground because of gravity, provided that we 
think that air resistance will have too slight an effect to matter. Whether this is true will 
depend on the type of object falling; a feather is designed to behave differently from a 
stone. (If they were falling in a vacuum, their behaviour would be identical; physics will tell 
you why this amazing thing is so.) I used the second of these equations of motion in the 
example of the ball thrown up in the air in Section 2.D.(g). In this particular case, a = -g 
because we are taking the upwards direction as positive, so the acceleration due to gravity is 
negative since it is downwards. 

If the acceleration is constant, we can see particularly nicely how the gradients and areas 
relate to each other when we compare the graphs of velocity against time and distance 
against time. 

I show these two graphs below. 




In (a), you can see that the acceleration (which is the rate of change of velocity with 
time) is given by the gradient of the line. This means that 
v — u 

a = so v = u + at. 

t 

The area under the line in graph (a) is given by adding the rectangle and the triangle, so it 
is ut + \ t(v - u) = ut + \at 2 which is the same as the distance s. 

The gradient of the curve of the distance-time graph shown in (b) gives us the rate of 
change of distance with time, that is, the velocity v at any time t. We have s = ut + jat 2 so 
dsldt = u + at = v. 

These two graphs have the same backwards and forwards links between them as the two 
graphs I used in Section 9.A.(c) to show you how differentiation and integration are the 
reverse processes of each other. 
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dv f dv f 1 

(3) — = g - kv so = dt so - — In \g - kv\ = t + C. 

dt ] g- kv ] k 



v = 0 when / = 0 so C = In g and we have In g - In \g - kv\ = kt 

k 



so In 



g- kv 



= kt so 



g- kv 



= e 



and ge = g - kv. 



This gives v = glk (1 - e~ kt ). 

Since e~ kt gets closer and closer to zero as t increases, v gets closer and closer to what is 
called its terminal value of glk. 



dv 

(4) We have v — = g - kv so 

dx 



vdv 
g-kv 2 



dx 



so -l/(2k) In \g - kv 2 \ = gx + C. Now, v = 0 when x = 0 so C = -\/(2k) In g 



so 



1 1 

v (In g - In \g - kv |) = In 



(2k) 



(2k) 



kv 2 



We'll show in the second part of the question that g is always greater than kv 2 , so we can 
use ordinary brackets for this log since the expression inside the brackets is always positive. 

The second part of this question starts by asking us to write v in terms of x. We have just 
shown that 



2kx = In 



g-kv 2 



so e 2kx = 



g 



so g - kv 2 = ge 
This means that 

v = {glk (1 - 



g-kv 2 
and v 2 = glk (1 - e - 2kx ). 



e- 2kx ). 



As x increases, e~ 2kx gets closer and closer to zero, so v gets closer and closer to the 
terminal velocity of V glk. It never actually becomes equal to it (this is the limiting value), 
so it certainly never becomes greater than it, and this justifies the use of ordinary brackets 
for the log in the answer to the first part of this question. 

Notice that the two terminal velocities which we have found for v in the last two 
questions mean that such a drag force is physically possible. If v had been able to become 
any larger in either case, the net force on the particle would be pushing it back upwards. 
This would mean that the second force acting on the particle couldn't be due to air 
resistance or viscosity because physically these forces can only slow down motion. 
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2 ± x4 8 2 ± 2V-T 
(1) z 2 - 2z + 2 = 0 so z = = = 1 ±j 



4 ± Vl6 - 52 4 ± 6V-T 
(2) z 2 - 4z + 13 = 0 so z = = = 2 ± 3j. 



-4 ± Vl6 - 20 -4 ± 2V-T 
(3) z 2 + 4z + 5 = 0 so z = = = -2 ± j. 



-2 ± v'4 - 24 -2 ± 2V^5 r - 
(4) z 2 + 2z + 6 = 0 so z = = = -1 ± V 5j. 
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Exercise 10.A.2 

(1) (a) 1 3 1 = 3 and arg 3 = 0 (b) | -2| = 2 and arg(-2) = ji 

(c) \2j\ = 2 and arg(2/) = jt/2 (d) | - 5/| = 5 and arg(-5/) = -;t/2 

(e) |5 + 12/1 = 13 and arg(5 + 12/) = tan" 1 (y) = 1.18 radians 

(f) |- 5 - 12/'| = 13 and arg(- 5 - 12/) = -» + tan" 1 (y) = -1.97 radians 

(g) |7 - 24/| = 25 and arg(7 - 24/) = tan" 1 (-y) = -1.29 radians 

(h) |- 7 + 24/| = 25 and arg(-7 + 24/) = n - tan" 1 (y) = 1.85 radians 

(i) |- \'3 +j\ = 2 and arg(-\i3 + j) = n - tan -1 (-jf) = 5tt/6 radians 
0) h 1 -jl = v2 and arg(- 1 -j) = -jt + tan -1 (1) = -3jt/4 radians 

(2) All four roots have a modulus of 1 . 

arg root (a) = 0, arg root (b) = ;t/2, arg root (c) = Jt, and arg root (d) = -nl2. 

(3) For equation (1), z x = 1 + j and z 2 = 1 -/, so |zj = |z 2 | = \'2. 

arg zj = tan _1 (l) = jr/4 radians and arg z 2 = -Ji/4 radians, by symmetry. 
For equation (2), z x = 2 + 3/ and z 2 = 2 - 3/ so |zj = |z 2 | = \13. 

arg z 1 = tan -1 ( 2 ) = +0.98 radians, so arg z 2 = -0.98 radians by symmetry. 
For equation (3), z 1 = -2 + j and z 2 =-2-j so \z x \ = |z 2 | = \'5. 

arg Zj — jt — tan -1 (5) = 2.68 radians, so arg z 2 = -2.68 radians by symmetry. 
For equation (4), z x = -1 + \5j and z 2 = -1 - V5/ so |zj| = |z 2 | = \/6. 

arg zj = Jt - tan -1 \ 5 = 1.99 radians, so arg z 2 = -1.99 radians by symmetry. 

Exercise 10.B.1 

(1) 1+7/ (2) -6 + 5/ (3) 5 + 4/ (4) -10-7/' 

Exercise 10.B.2 

(1) (a) 9 + 6/ (b) 7/ (c) 1 (d) 6 + 23/ 

(2) (a) You should have (i) w = 3 + 4/ and \w\ = 5, (ii) w = 15 + 8/ and |w| = 17 and 

(iii) w = 1 + 24/ and |w| = 25. 

These give the three right-angled triangles with sides of lengths 3, 4 and 5 units, 8, 
15 and 17 units, and 7, 24 and 25 units respectively. 

(b) w = z 2 = (x + jy f = x 2 - y 2 + 2xyj so u = x 2 - y 2 and v = 2xy. 

If x and y are whole numbers, then \w\, u and v must also be whole numbers. (In 
case (iii), |z| is also a whole number.) 

(c) You should find that drawing the Argand diagram for w gives you a triangle with sides 
of 119, 120 and 169 units, which is almost isosceles. 

Drawing the Argand diagram for W gives you a remarkably long thin triangle with 
sides of 239, 28 560 and 28 561 units. 

Both triangles have sides given by whole numbers and, of course, fit Pythagoras' 
Theorem. 

(3) You will know if you have the right answers for this question since you are just checking 
what you have already done. 

I will go through one as an example in case you have any problems. 
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The roots of z 2 + 2z + 6 = 0 from question (4) of Exercise 1 O.A.I are -1 ± \ 5/. 
Putting z = -1 + \5J back into the equation gives 

(- 1 + v5/) 2 + 2 (-1 + Vfy) + 6 = 1 - 2V5y + 5/ 2 - 2 + 2x5/ + 6=1-5-2 + 6 = 0. 

The other root checks similarly. 

For this equation, a = 1, b = 2 and c = 6. 

The sum of the roots is 

(-1 + v5/') + (-1 -y[5j) = -2 = -bla. 

The product of the roots is 

(-1 + V5/X-1 x5/) I Jsj • o/ 5/ ; 6 da. 

(4) The three roots are symmetrically placed so they will look like the ones in my drawing 
below. 




Each root has a modulus of one and the arguments are 0, 2tt/3 and - 2tt/3. From this, 
root (1) = 1, 

/ (2n\ (2ji\\ 1 V3 

root(2)=l^cos^ T j+ i sm^ T jj=-- + T7 , 

( ( 2n\ ( 2ji\\ 1 ^3 

root (3) = 1 [cos (- —j +j sin ^- yjj = - - - —j. 

(These results use the sin and cos of Jt/3 from Section 4.A.(g) and the symmetry of the 
figure.) 

If root (2) = w, then \w X co\ = 1 since \co\ = 1, and arg(a> X a») = 2 arga> = 4tt/3. 
The principal value of this argument is -2jt/3, so we see that m X w = a> 2 or multiplying 
root (2) by itself gives root (3). 
Equally, we could say that 

/ 1 \ 3 \ / I x 3 \ 1 x 3 x 3 3 , 1 V3 

CO X w = - - + — j - - + — j = - - — j - — j + -f = - - - — j = (O 2 . 
\2 2/ \2 2/ 44 4 4 22 

Similarly, we can show that a> 2 X co 1 = co, using either of the above methods. 
The thinking point at the end of Exercise 10.B.2 is answered in Section 10.D.(a). 



Exercise 10.B.3 

The conjugates are all shown drawn in on this diagram. They are all reflections of each other in 
the real axis. Notice that this means that z 5 is its own conjugate. 
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Exercise 10.B.4 

The working for the first four questions is exactly the same as in the example given just before 
the exercise. For each one, you just multiply the top and the bottom by the conjugate of the 
bottom and then tidy up. 

The answers you should have are as follows. 

(1) 5(6-3/) (2) Ul2 + Sj) (3) j (4) -p(l + 13/) 

Notice that the answer to question (3) is the same as the result which we got by using the 
mod/arg form in Section lO.B.(c). 

2-7 i 

(5) Multiplying out the bottom gives . Simplifying this gives 55 (5 -j). 

11-3/ 

3(2 + j) 2(3 - 2j) 

(6) The first stage gives + 



Adding these gives 



5 13 
39(2 + j) + 10 (3 - 2j) 108 + 19/ 



65 65 
(See Section l.C.(c) if necessary.) 



Exercise 10.C.1 

(1) The working for each of these is the same as in the example which I have already given, so 
I shall just give you the three pairs of answers here. 

(a) 2+j and -2 -j (b) 4+j and -4 -j (c) - 3 + 2j and 3 - 2j. 



Each pair of roots forms a straight line on its Argand diagram. 
(2) The four Argand diagrams are shown below. 
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The working for each part goes as follows. 

(a) z 2 = 4j = [4, jt/2] in mod/arg form, so z x = [2, jt/4] and z 2 = [2, - 3tt/4]. 

(b) z 2 = 1 + ^3/ = [2, jr/3] so Zj = [^2, jt/6] and z 2 = [\'2, - 5nl6\. 

(c) z 2 = -V2 + ^2/ = [2, + 3jt/4] so z x = [V2, 3tt/8] and z 2 = [V2, -5tt/8]. 

(d) z 2 = -1 - V3/ = [2, -2jt/3] so zj = [Vz, -«/3] and z 2 = [v2, 2;t/3]. 

Exercise 10.C.2 

(1) cos 30 + 7 sin 30 = (cos 6+j sin 0) 3 

= cos 3 0 + 3 cos 2 0(/' sin 0) + 3 cos 0(/' sin 0) 2 + (j sin 0) 3 

(a) Equating the real parts gives 

cos 30 = cos 3 0 - 3cos 0 sin 2 0 = cos 3 0 - 3 cos 0(1 - cos 2 0) = 4 cos 3 0 - 3 cos 0. 

(b) Equating the imaginary parts gives 

sin 30 = 3 cos 2 0 sin 0 - sin 3 0 = 3 sin 0 - 4sin 3 0. 

(2) cos 70 + j sin 70 = (cos 0 + y'sin 0) 7 

= cos 7 0 + 7 cos 6 6(J sin 0) + 21 cos 5 6{j sin 0) 2 + 35 cos 4 6(j sin 0) 3 
+ 35 cos 3 0 (/' sin 0) 4 + 21 cos 2 0 (/' sin 0) 5 + 7 cos 0(/ sin 0) 6 
+ (/ sin 0) 7 
Equating the real parts gives 

cos 70 = cos 7 0-21 cos 5 0 sin 2 0 + 35 cos 3 0 sin 4 0 - 7 cos 0 sin 6 0 

= cos 7 0-21 cos 5 0(1 - cos 2 0) + 35 cos 3 0(1- cos 2 0) 2 - 7 cos 0(1 - cos 2 0) 3 
= cos 7 0-21 cos 5 0 + 21 cos 7 0 + 35 cos 3 0 - 70cos 5 0 + 35 cos 7 0 

- 7cos 0(1-3 cos 2 0 + 3 cos 4 0 - cos 6 0) 
= 64cos 7 0-112 cos 5 0 + 56 cos 3 0 - 7 cos 0. 

Exercise 10.C.3 

(1) cos 5 0 = (j) 5 (z + 1/z) 5 

= (j) 5 (z 5 + 5z 4 (1/z) + 10z 3 (1/z) 2 + 10z 2 (l/z) 3 + 5z(l/z) 4 + (1/z) 5 ) 
= (j) 5 (z 5 + 1/z 5 + 5(z 3 + 1/z 3 ) + 10 (z + 1/z)) 
= (j) 5 (2 cos 50+10 cos 30 + 20 cos 0) 
= (\f (cos 50 + 5 cos 30 + 10 cos 0) 

(2) (/'sin0) 5 = (i) 5 (z-l/z) 5 

= (j) 5 (z 5 + 5z 4 (-1/z) + 10z 3 (-1/z) 2 + 10z 2 (-1/z) 3 + 5z(-l/z) 4 + (- 1/z) 5 ) 
= (s) 5 (z 5 - 1/z 5 - 5(z 3 - 1/z 3 ) + 10(z - 1/z)) 
= (j) 5 (2j sin 50 - 10/ sin 30 + 20/ sin 0) 

so sin 5 0 = ( 2 ) 4 (sin 50-5 sin 30 + lOsin 0) since j' 5 =j. 
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Exercise 10.D.1 



I have used the general rule for finding roots for these answers, but your answers will be equally 
good if you have used the geometry of your sketches. In either case, these sketches make a check 
on your arithmetic because you can quickly get a good idea of whether your calculated roots fit 
with what you have drawn. 

(1) Let w — z 3 — 21 j so |w| = 27 and arg w = jt/2 + 2ht. Then 



z = 27 1/3 exp 



(jt/2 + 2ht) 



3 exp 



it + 4hi 



giving the three distinct solutions of 3e j(jl,6 \ 3e> X5 * /6) and 3e^' (3l/2) . 
(2) Let w = z 4 = 1 + j so \w\ = \ 2 and arg w = ill A. This gives us 



z = (V2)" 4 exp 



(ji/4 + 2hi) 



)l/8 



exp 



7T + 8&7T 
16 



giving the four solutions 2 1/8 exp [/(jt/16)], 2 1/8 exp [/(97r/16)], 2 1/8 exp [/(-7jt/16)] and 
2 1/s exp [/• (- 15jt/16)]. 

(3) Let w = z 3 = 5 (3 - 4j) so |w| = § = 1 and arg w = tan -1 (-5) = -0.927 radians to 3 d.p. 



So |z| = l and z = exp 



J 



(-0.927 + 2for) 



giving exp (-0.309/), exp (1.785/) and exp (-2.403/). 
The three roots in a + form to 2 d.p. are 

cos (-0.309) + j sin (-0.309) = 0.95 - 0.30/, 

cos (1.785) + j sin (1.785) = -0.21 + 0.98/, 

cos (-2.403) + 7'sin (-2.403) = -0.74 - 0.67/. 



(4) Let w = z 5 = -4 + 4/ so \w\ = \ 32 = \'2 5 and arg w = 3nlA, giving us |z| = (\'2 5 ) 1/5 
so that 



12, 



(3ji/4 + 2ht) 





/3tt + 8fer\ 


= \'2 exp 


•\ 20 jj 





1 2 exp 



giving \/2 exp [/(3jr/20)], \ 2 exp [/(l ljr/20)], \ 2 exp [/(19«/20)], \'2 exp [/(- 5?r/20)] and 
\/2 exp [/'(-13jt/20)]. 



(5) Let w = z 6 = -4 - 4\/3/' so |w| = v 16 + 48 = 8 and arg w = -2jt/3 so |z| = 8 
giving 



1/6 



)l/2 



z = \' 2 exp 



./ 



(-2jt/3 + 2for) 



/2 exp 



-n + 3for 
9 



with k = 0, -1, -2, 1, 2 and 3 to give the six distinct answers. 



(6) Let 



1 



v z+ 1 



= w. 



-1 has the four roots of e 7 " 74 , 



Now we know from the beginning of this section that w 4 
e>^\ e*- 71 ' 4 ) and e>' ( - 3jM \ 

w 4 = -4 will have the same four roots except that each one will be multiplied by \ 2. 
Writing the first of these in a + bj form gives V2 (cos (pi I A) + j sin (nlA)) = 1 + j. 
Similarly, the other three roots are -1 + j, 1 - j and -1 -j. 
Now, using the first root, we have 

z- 1 

= 1 +j so z - 1 = (z + 1)(1 +j) = z + zj + 1 +j so z = I/7 (-2 -7) = 2/ - 1 

z + 1 

multiplying top and bottom by -j. 
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The second root gives 

z-l=(z+l)(-l+j) = -z + zj-l+j so z(2-j)=j 
giving 



l. 



In a similar way, the other two roots come out as -1 -2j and f (—2/ - 1). You could save the 
extra working here by noticing that, if you had started by multiplying out, you would have 
got an equation for z with real coefficients. This means that its roots must come in 
conjugate pairs, which you will see these roots do indeed do. 

If you are at all doubtful about any of your sketches, use my solutions to draw yourself a new 
one so that you can check your own version. 

Exercise 10.D.2 

(1) a=l,b = -(3 - 2j) and c = -(1 + 3j) so 



(3 - 2j) ± V(3 - 2jf + 4(1 + 3j) (3 - 2j) ± \5 12/ • 4 • 12/ 

z = = 

2 2 

= ±(3 -2/ ±3) = 3 -j or -j. 

(2) a = 1, Z> = 3 - 4/ and c = -(22 + 6/') so 

-3 + 4/ ± V(3 - 4j f + 4(22 + 6j) -3 + 4/ ± V-7 - 24/ + 88 + 24/ 

z = = 

2 2 

= 5(-3+4/±9) = 3+2/ or -6 + 2/. 

(3) a = 1, Z> = 2 +7 and c = -(3 + j) so 

-(2 + 7) ± V(2 +jf + 4(3 +j) -(2 + 7) ± V3+4/+ 12 + 4/ 

z = = . 

2 2 

From (l)(b) from Exercise 1 O.C.I we know that the square roots of 15 + 8/ are 4 +j and 
-4 - j so we have 

-(2 +7) ± (4 +7) 



1 or -3-7. 



(4) a = 1, b = 1 and c = -(5 + 7) so 



-1 ± Vl +4(i+7) -1 ± V3 + 47 



2 2 

We know that the square roots of 3 + 4/ are ±(2 + 7) from (l)(a) from Exercise 10.C.1 so 
-1 * (2 +J) , 1 . 31. 

z = = 2 + lJ 0r ~2 ~ 2i- 

(5) a = 1, b = -1 and c = -1 + 3/ so 



1 ± Vl + 4 - 12/ 1 ± (3 - 27) 

z = = = 2 -j or -1 +7 

2 2 

using (l)(c) from Exercise 10.C.1. 
(6) a=\,b = -2(1 + 7) and c = 5(1 - 2/) so 

2(1 +7) ± V4(l +jf- 20(1 -2j) 2(1 +7) ± V-20 + 487 



2(1 + 7) ± 2V-5 + 12/ , 

" - (1 • /) ± V-5 + 12/. 
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Using the methods of Section 10. C. (a), the square roots of -5 + 12/ are ±(2 + 3j), or you 
could also get these from multiplying the two roots of 5 - 12/ from the previous answer by 
j. This gives 

z = (1 +7) ± (2 + 3/) = 3 + 4/ or -1-2/ 

(7) If we put w = z 2 , we have w 2 - (1 + j)w + j = 0. Using the formula gives 



1 +7± V(l +7) 2 -4/ I • / : \ 2/ 

w = = . 

2 2 

The square roots of —2j are 1 -y and -1 + j, either from using the methods of Section 
10.C.(a) or drawing a little picture as I have below. 




-21 

This gives w = \ (1 + j ± (1 - j)) = 1 or j so z 2 = 1 or j so z = ±1 or ± (1/\'2)(1 + j). 



Exercise 10.D.3 

(1) z = 1 + 2/ is also a root of f{z) = z 4 - 3z 3 + az 2 + bz - 30 = 0. 

Therefore (z - (1 - 2/)) (z - (1 + 2/)) is a factor of /(z). 

Multiplied together, this gives (z 2 - 2z + 5) is a factor, so now we have 

z 4 - 3z 3 + az 2 + bz - 30 = (z 2 - 2z + 5) (z 2 + pz - 6). 

Matching the terms in z 3 gives us -3z 3 = pz 3 - 2z 3 so p = -1 . Therefore 

z 4 - 3z 3 + az 2 + bz - 30 = (z 2 - 2z + 5)(z 2 - z - 6). 

Matching the terms in z 2 gives us az 2 = 5z 2 - 6z 2 + 2z 2 so a = 1. 
Matching the terms in z gives us bz = 12z - 5z so b = 7. 

The other two roots come from z 2 - z - 6 = (z - 3)(z + 2) = 0 so they are z = 3 and z = -2. 

(2) ^/ is also a root so (z -/) (z - (^/)) is a factor of the equation. 

Multiplying these gives us 

z 4 + az 3 + bz 2 - 4z + 13 = (z 2 + 1) (z 2 + /?z + 13). 

Matching terms in z gives -4z = pz so p = -4. 
We now have 

z 4 + az 3 + bz 2 - 4z + 13 = (z 2 + 1) (z 2 - 4z + 13). 

Matching terms in z 3 gives az 3 = -4z 3 so a = -4. 

Matching terms in z 2 gives te 2 = z 2 + 13z 2 so b = 14. 

The other two roots come from the solution of z 2 - 4z + 13 = 0. 

Using the formula (see Section 10.D.(b) if necessary) gives us the conjugate pair of 

z = 2 ± 3/ 
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(3) z = 1 + j is also a root of the equation /(z) = 0 so (z - (1 -7')) (z - (1 + j)) = z 2 - 2z + 2 is 
a factor of /(z). 
This gives us 

z 4 + az 3 + fe 2 + 1 = (z 2 - 2z + 2)(z 2 + pz + |). 

Matching terms in z gives 0 = -z + 2pz so /? = 5. 
We now have 

z 4 + az 3 + bz 2 + 1 = (z 2 - 2z + 2) (z 2 + jz + 5). 

Matching the terms in z 3 gives az 3 = -2z 3 + jz 3 so a = -5. 

Matching the terms in z 2 gives bz 2 = 2z 2 + \z 2 - z 2 so b = j. 

oil 9 

For the remaining two roots, we solve z + jz + 5 = 0 or 2z + z + 1 = 0. 
Using the formula gives the conjugate pair of z = \ (-1 ± v'7/). 

Exercise 10.E.1 

I show the sketch for each question below. 




lz(> 
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Exercise 10.E.2 

I show the sketches for each of the answers below. 

(5) is the impossible question. You can't have a length of -2 units! 



© 




IZ-3l= I 
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Exercise 10.E.3 



(1) (a) Putting z = x + jy gives 

\{x-\)+jy\ = \{x+\)+jy\ 

so v(x- l'f+y 2 = V(x + l) 2 + 
so (x - l) 2 + y 2 = (x + l) 2 + y 2 



so 0 = 4x or x = 0. 



(b) 



This means that Re (z) = 0 so z may lie anywhere on the imaginary axis. 

You can also think of this geometrically as all the points of intersection of lots of 
pairs of circles, each pair having the same radius but with centres at (1,0) and (- 1,0). 
This means that these points of intersection are all on the imaginary axis. 

z+ 1 

1so|z+1| = |2z-t| so \(x+l)+jy\ = \2x+j(2y-l)\ 



2z-j 

so v(x + l) 2 + y 2 = v4x 2 + (2y- if so (x + l) 2 + y 2 = Ax 2 + (2y - if. 
Tidying up gives 



so 



3x 2 - 2x + 3y 2 - 4y = 0 so 

(* - \f + (y - b 2 = § 



: x + y 2 - jy = 0 



2 2 2 
1 , 2 



so the path of z is a circle whose centre is at 3 + 3 j and whose radius is V 5/3 units. 

(2) (a) |z| < |z -j\ so |x +jy\ < \x +j(y - 1)| 

so x 2 + y 2 < x 2 + (y - if so 2y < 1 or y < \. 

This means that Im (z) < \ so z can lie anywhere in the region below the horizontal 
line y = 2- 

(b) |z| < V'2|z -j\ so |x < V2|x +j(y - 1)| 

so vx 2 + j 2 < V2 Vx 2 + (y - \f so x 2 + y 2 < 2(x 2 + (y - if ) 

which tidied up gives 0 < x 2 + y 2 - 4y + 2 or x 2 + (y - 2f > 2 

so z can be anywhere outside or on the boundary of the circle with its centre at 2j 
and a radius of \ 2 units. 

(c) 2\z\ < \2z -j\ so 2|x +jy\ < \2x +j(2y - 1)| 

so 2*jx 2 +y 2 <Ux 2 + (2y-lf so 4(x 2 + y 2 ) <4x 2 + (2y- if so 0<l-4vorj<|. 
We have Im (z) < \ so z can lie anywhere on or below the horizontal line y = \. 



(3) 



We have Re 



l+jz = l+jx-y. 



v z + 1, 

This means that x must be equal to zero since the LHS of this equation is real. So we 
can say 

z x+jy jy 

= = putting x = 0. 

z+1 ( v • I ) • Jy l+jy 

Tidying this fraction up by multiplying the top and bottom by the conjugate of the 
bottom gives 



jy 



1 -jy\ jy+y 



i+jy/\i-jy/ 1 • .«-' 

and we have 



so Re 



y 



z+1/ l+y 2 



y 



1 +y 2 



3 93.39 -3 

j-y so y 1 = 5 + 5 y - y - y 
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so 



2y 3 -y 2 + 2y-3 = 0. 



y = 1 is a solution, so we get (y - 1) (2y 2 + y + 3) = 0. 

Now 2y 2 + j + 3 = 0 has no real roots. Therefore, since y is real, there is just the 
one solution to the given equation of z = j. 

Exercise 10.E.4 

(1) We can't have z = 1 as this would give a zero underneath the fraction. 

(2) (a) Ifz = 2thenw = 2+./'. 

1 

(b) If z = 1 —j then w = — =j multiplying top and bottom by j. 

-j 



(3) w = u + jv = 



z+j x+j(y+l) 



z-1 (x-l)+jy 
Simplifying the fraction by multiplying it top and bottom by (x - 1) -j'y gives 



x+j(y+m (x-l)-jy 



(x- Y)+jy) \(x- \)-jy 



x 2 - x + y 2 + y 



so 



u = 



(x-\f+y 2 
(4) If u — v it must be true that x 2 



and v = 



1 -x+y 2 + y+j(x-y- 1) 

(x- l) 2 +y 2 
x—y—\ 



■ x + y + y 



(x-lf+y 2 
-y-l. 



Therefore the path of z is given by the equation x 2 - 2x + y 2 + 2y + 1 =0. 
This is the same as (x - l) 2 + (y + l) 2 = 1 so, if u = v then z must lie on the circle 
whose centre is at 1 - j and whose radius is 1. 

(5) I show the two Argand diagrams below. Again the little circle at z = 1 is to show that we 
have to exclude this point. 





J 








1 








1 






J — - . 


H 






-1 

Wj 




o i 





/ 









With u = v 




ArganJ dVcigram ^Viowin^g 
the Corre^poftdiog path ^br z 



(6) (a) If z, = -7 then Wl = 0. 
(b) If z 2 = 2 - j then w 2 = 



2(1 +j) 



1 -7 (1 -7) (1 +7) 



1 +j- 



1-7 2/ + 2 ! j 
(c) If z 3 = 1 - 2/' then w 3 = = = 5 + 5 j- 



-2j 



(d) Ifz 4 



I - \j then 



w 4 



2,4. 
5 



+ 57 2 + 47 



3 1 . 
5-5/ 



2 + 47)\ /-3+y 



-7 



-3+7 



10 - 107 

10 



-1 -j- 
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(e) w 5 




5 4 5 2 = - \ - \ j tidied up similarly. 

-5-sJ 




(f) w 6 = 3 — — = 2 + 2/ tidied up similarly. 

5 "Si 

I have shown all these pairs of points marked on the two Argand diagrams above. It looks 
as though the part of the z-circle which turns anticlockwise or positively from (0, -1) to 
the point which we aren't allowed of (1,0) is giving the positive values on u — v, and the 
part which turns clockwise or negatively between these two points is giving the negative 
values on u = v. 

It also looks as if the points are coming in the same order when they transfer from u = 
v to the circle of z, but we can see that equal distances apart on u = v don't mean that we 
get equal distances apart on the z-circle. The circle is made up of a kind of variegated 
elastic. 

You might like to see for yourself where points like w 7 = 10 + 10/ or Wg = — 10 — 10/ 
will go. (If you do this by first writing z in terms of w, you will find that this particular 
relationship has a rather special property.) 

(7) If v — 0 then from (3) we get x-j-l=0soz lies on the line y — x - 1. 

We just have to exclude the point (1,0) where z = 1. 

(8) If u = - v then from (3) we get 

x 2 -x+y 2 +y = -x+y + 1 or x 2 + y 2 = I. 

This means that z lies on the unit circle about the origin, again excluding the point (1,0) 
where z = 1 . 

(9) You have sufficient information above to check your two sketches. 

(10) Although the shapes of the paths of w and z are different, you will see from your two 
sketches that the angles at which these paths cut each other are unchanged. This is an 
example of an important general property of transformations like this. 

All the questions which you have answered here are just on the edge of a whole new 
area of maths. You will learn the general theory of what is happening here if you do a 
course on complex variables. 



Exercise 11 A 1 

(1) Since the particle is in equilibrium, the vector sum of the three forces acting on it is zero. 
Therefore they can be represented by the three sides of a triangle. I have drawn a sketch of 
this. You should always draw a sketch for this type of question. 



Notice that the arrows follow each other round, so giving zero displacement as the sum. 
It doesn't matter that the directions shown in the sketch may not be the true directions of 
the forces since the sizes of the angles between the directions will be correct. The 
orientation of the triangle doesn't affect these. 
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We are asked for the angle between the lines of action of the two largest forces. This is 
the obtuse angle at A. We find this by first finding /LBAC using the Cosine Rule. (See 
Section 4.B.(c) if necessary.) This gives us cos ,4 = (100 + 169 - 49)/260 so ,4 = 32.2° to 
1 d.p. and the required angle is 180° - 32.2° = 147.8°. 

Again, I have drawn a sketch. The three medians do all meet in a common point though we 
shall not need to use this result for this solution. 




We have AB = p, BC = q and CA = r. Also, BD = jq, CE = \r and AF = \p. So AD = 
p + jq and BE = q + \v and CF = r + jp. 

Therefore AD + BE + CF = |(p + q + r) which is the zero vector since the sum p + q + 
r gives zero displacement. Notice that this means that AD, BE and CF will also fit together 
to make a triangle! 

Again, I have drawn a sketch. Remember that a, b and c are not the vectors giving the sides 
of the triangle ABC but the position vectors of A, B and C from the origin, O. 




Let OM = m. Since M is the midpoint of AC, we have m = j(a + c) using the ratio theorem. 
We are told that G divides BM in the ratio 2 : 1 so, again using the ratio theorem, we 

> 12 12 1 1 

have OG = g = 3b + = 3b + 5 [5 (a + c)] = 5 (a + b + c). 

Since this answer is symmetrical in a, b and c, the position vectors of the points 2/3 of 
the way down the other two medians will also work out to the same result of 1/3 (a + b + 
c). Therefore the three medians of the triangle must intersect each other at the point G 
which lies 2/3 of the way down each of them from the corresponding vertex of the triangle. 

G is called the centroid or centre of mass of triangle ABC. It is useful to remember that 
its position vector from the origin is given byg = 3(a + b + c) where a, b and c are the 
position vectors from the origin of A, B and C. Notice that a, b and c are not the sides of 
the triangle. 



Answers to the exercises 



(4) Again, I have drawn a sketch. 




From this, we have k = p + PK but PK = [1/(1 -m)]PQ and PQ = -p + q so 

/ (/ - m)p + l(-p + q) /q-mp 

k = P + (-P + q) = ; = — 

I — m I — m I — m 

We now replace m by —m in this result to take account of KQ being in the opposite 
direction to PK and so PK : KQ = I : -m. Doing this gives us the same formula for k as 
we found for the internal division of PQ in Example (3). We now have 

mn + la 

k = — 

/ + m 

Exercise 11.A.2 

(1) First we draw the sketch as I show below. 




X, the magnitude of the component X of the velocity due east, is given by 
X= 42 cos 28° = 37.1 knots to 1 d.p. 

Y, the magnitude of the component Y of the velocity due north, is given by 
Y = 42 sin 28° = 19.7 knots to 1 d.p. 

(2) R = P + Q + S = (2i - 3j) + (4i + j) + (- 3i + 2j) = 3i. I show a sketch below. 




This vector addition works in exactly the same way as adding the complex numbers 2-3/, 
4+7 and -3 + 2j, so getting 3, and then showing this result on an Argand diagram as in Section 
lO.B.(a). 
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(a) We can write down the position vector from the origin of any point in two-dimensional 
space using multiples of f and g. 

In particular, r = 4i + 9j = (2i + j) + 2(i + 4j) = f + 2g. 

(b) Using multiples of g = i + 4j and h = 4i + 2j, it is again possible to find the position vector 
from the origin of any point in two-dimensional space. 

In particular, r = 4i + 9j = 2g + 5 h. 

If you can't spot this combination, you could find it in the following way. 

Let r = 4i + 9j = a (i + 4j ) + b (4i + 2j ) where a and b are numbers which we have to find. 

Matching up the i components gives us 4 = a + 4b. 

Matching up the j components gives us 9 = 4a + 2b. 

Solving these two simultaneous equations as in Section 2.C gives us a = 2 and b = 1/2. 
Therefore r = 2g + j h. 

(c) It is not possible to find the position vector of any point in two-dimensional space by using 
multiples of f = 2i + j and h = 4i + 2j because f and h are parallel to each other, with h 
= 2f. Only vectors which are also parallel to them can be described. In particular, since r 
is not parallel, it can't be described in terms of f and h. 

It is impossible to find numbers a and b such that 4i + 9j = a (2i + j ) + b (4i + 2j ) because the 
equations 2a + 4b = 4 and a + 2b = 9 have no solutions. 

The two pairs of f and g and g and h are both linearly independent but the pair f and h are 
linearly dependent since h = 2f. 

Exercise 11.A.3 

If v = 4i + 3j then |v| = ^4 2 + 3 2 - 5, an d v = 1/5 (4i + 3j). 

If w = 8i - 15j then |w| = ^8 2 + (-15) 2 = 17, and w = 1/17 (8i - 15j). 

Your sketch should be similar to my Figure 11. A. 12 but with a = 4, b = 3 and c = 12. 
Then |p| = ^4 2 + 3 2 + 12 2 = 13. The unit vector is p = 1/13 (4i + 3j + 12k). 

The resultant force F of P, Q and R is given by 

F = P + Q + R = (4i + 3j + 7k) + (2i + 9j - 7k) + (4i - 7j + k) = lOi + 5j + k. 

For the particle to be in equilibrium, we need to have an extra force of -F acting on it, so we 
need a fourth force of -lOi - 5j - k. 

We want the sum of p and q to be zero, with neither p nor q themselves equal to zero. An 
example of this is p = 2i - 3j and q = -2i + 3j. We just need any pair such that p = -q. 
If p and q are displacements, then making the displacement p followed by the displacement q 
brings you back to where you started. 

If p and q are forces, they are equal and opposite so, if they both act on a particle, they will 
have no net effect. If they act on a larger body, and their lines of action are parallel rather than 
identical, then they will have a turning effect on the body and form what is called a couple. 

There is an infinite number of possibilities here! 

One example is p = 2i + 3j - k, q = -3i - j + 4k and r = i - 2j - 3k. 

Since p, q and r have a vector sum of zero, we can say that r = -p -q. Therefore r can be 
described in terms of p and q and so these three vectors are not linearly independent. 
Geometrically, they form the sides of a triangle, since put together they give zero displacement. 
This means that they lie in a flat surface, a two-dimensional space inside our three-dimensional 
space. We can't describe all points in three-dimensional space in terms of them because we can't 
escape from this flat surface. 



1 j = 10 - 3- 2 = 5 ^v^VsOcos? 




so cos / = — == — ==■ and t = 75.9°. 
\/14 V30 



Answers to the exercises 



(b) Here b = 3a so the two vectors are parallel and the angle between them is zero. (The 
working out of a • b gives cos t = 1 so t = 0°.) 

(c) a • b = ^ -6 J - ^ 3 j = -12 - 18 + 30 = 0 

so cos t=0 and t = 90°, so a and b are perpendicular. 

(d) a • b = ^ -5 J • ^ lj = 3- 5- 6 = -8 = v / 38v r ncos? 

-8 

so cos / = — == — == and t = 1 13.0°. 

V38 Vn 

(e) Here b = -4a so the two vectors are parallel and in opposite directions so the angle between 
them is 180°. (The working out of a • b would give cos t = — 1.) 

(2) We have AB = -a + b = -7i + 5j - 3k and B~C = -b + c = 3i -7j + k and CA = -c + a = 4i 
+ 2j + 2k. 

The dot product of B~C and ~CA is (3 x 4) + (- 7 x 2) + (1 x 2) = 0 so the sides fiC and CA 
are perpendicular. Since the triangle is right-angled, its area A is given by pC||C4|. We 
have \BC\ = ^59 and \ CA\ = Jl4 soA= 18.8 square units to 1 d.p. 

(3) (a) Working as in Example (4) we have 

/ 6 \ / u \ 

0 so 6u + 4v + Iw = 0. 









(■I 


• 


: 






\ w / 









(;! 


• 








\ w 1 



If we let v = 9 then w = -12 and « = 8. (These are the smallest whole number 
solutions.) 



This gives n = 8i + 9j -12k and |n| = V289 = 17 
Therefor 
(b) We have 



Therefore n = ^ (8i + 9j - 12k) or, equally good, (-8i - 9j + 12k). 









(:! 


• 








\ w 1 











• 








\ w 1 



: 0 so 2w-4v-3w = 0. 

If we let u — 12 then v = 3 and w = 4. (These are the smallest whole number solutions.) 
This gives n = 12i + 3j + 4k and |n| = ^169 = 13. 
Therefore n = ^ (12i + 3j + 4k) or, equally good, ^ (-12i - 3j - 4k). 
(c) We have 

v 1 = 0 so 3w + 2v = 0 and I 3 I • I v 1 = 0 so 3v + 4w = 0. 









CI 


• 


: 






\ w I 









(!) 


• 








\ w 1 



If we let u ~ S then v = -12 and w = 9. 

This gives n = 8i - 12j + 9k and |n| = ^289 = 17. 

Therefore n = fj (8i - 12j + 9k) with -n an equally good solution. 



Exercise 11.B.2 



(1) axb 



i J 

1 1 
0 1 



i(l) +j(-l) + k(l) = i-j+k. 
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(2) a x b 



i j k 

3 1 2 
5 0 1 



i(l)+j(10-3) + k(-5) = i + 7j-5k. 



(3) a x b 



i j k 

2-7 3 
1 -2 -1 



i(7 + 6) + j(3 + 2) + k(-4 + 7) = 13i + 5j + 3k. 



(4) a x b = 



i j k 

1 -2 -1 
2-7 3 



i(-6 - 7) + j(-2 - 3) + k(-7 + 4) = -13i - 5j - 3k. 



(5) a x b 



i j k 

5 0-2 
3 0 4 



i(0)+j(-6-20) + k(0) = -26j. 



Since both a and b here lie in the x-z plane, their cross product is parallel to the y-axis. 



Exercise 11.B.3 



It's worth comparing your solutions to 1(a), (b) and (c) below with those for 3(a), (b) and (c) in 
Exercise 1 1 .B. 1 . The questions are identical but the methods of solution are different. 

(1) (a) A vector perpendicular to a and b is given by 



n = a x b = 



i j k 

0 4 3 
6 4 7 



= i(28 - 12) + j (18) + k(-24) = 16i + 1 8 j - 24k. 



From this we have |n| = 1 156 = 34 so the unit vector ft = p (16i + 1 8 j - 24k) = 
Tf (8i + 9j - 12k). 

(b) A vector perpendicular to a and b is given by 



n = a x b = 



1 j k 

-14 0 

2 -4 -3 



= i(-12) + j(-3) + k(4 - 8) = -12i - 3j - 4k. 



From this we have |n| = V 169 = 13 so the unit vector ft = (— 12i - 3j - 4k) or 
£ (12i + 3j + 4k). 



(c) A vector perpendicular to a and b is given by 

i j k 

3 2 0 
0 3 4 



n = a x b 



= i(8) + j(-12) + k(9) = 8i - 12j + 9k. 



From this we have |n| = y289 = 17 so the unit vector ft = jy(8i - 12j + 9k). 
(d) A vector perpendicular to a and b is given by 



n = a x b = 



i j k 

3 2 2 
-3 1 4 



i(8 - 2) + j(-6 - 12) + k(3 + 6) = 6i - 18j + 9k. 



From this we have |n| = \44l = 21 so the unit vector ft = ji (6i - 1 8 j + 9k) = 
f(2i-6j + 3k). 

(e) A vector perpendicular to a and b is given by 



n = a x b 



1 j k 

2 0 2 
2 4-5 



= i(-8) + j(4 + 10) + k(8) = -8i + 14j + 8k. 



From this we have |n| = ^324 = 18 so the unit vector ft = jg (-8i + 14j + 8k) 
I(_4i + 7j + 4k). 
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(f) A vector perpendicular to a and b is given by 



n = a x b 



i j k 

4 3 2 
-4 3 0 



i(-6) + j(-8) + k(12 + 12) = -6i - 8j + 24k. 



From this we have |n| = y 676 = 26 so the unit vector ft = 26 (~6i - 8j + 24k) = 
£(-3i - 4j + 12k). 



(2) R = Fj + F 2 + F 3 = 6i + j + k. 

Since F 1 acts through the origin, its moment M 1 about the origin is zero. M 2 , the moment 
of F 2 about the origin, is given by 



M, = 



i j k 

1 2 1 
3 4-4 



= i(-8 - 4) + j(3 + 4) + k(4 - 6) = -12i + 7 j - 2k. 



M 3 , the moment of F 3 about the origin, is given by 



M 3 



i j k 

1 0 -3 
1 -2 2 



•(-6) + j(-3 - 2) + k(-2) = -6i - 5j - 2k. 



The total moment of the three forces about the origin is given by Mj + M 2 + M 3 
-18i + 2j -4k. 



(3) We have a x b = c which gives us 



a x b 



1 j k 

x y z 

2 -1 1 



= i(y + z) + j(2z - x) + k(-x - 2y) = i - 3j - 5k. 



For the vectors to be equal, the separate components must be equal, so this gives us the 
three equations 

\ y + z=\ (1) 

2z - x = -3 (2) 
[x + 2y=5 (3) 

Now (2) + (3) gives 2z + 2y = 2 which is the same as (1) so there is more than one set of 
possible solutions to these equations. For example, x = 1, y = 2 and z = -1 will fit, giving 
a! = i + 2j - k, and so also will x = 3, y = 1 and z = 0, giving a 2 = 3i + j. There is no 
unique solution for a. 

We can see why this is by looking at a physical example. Suppose that b is a force. Then 
c = a x b is the equation which gives the moment c of b about the origin with a being the 
position vector of any point on its line of action. So we see that there are an infinite number 
of possibilities for a. 

If you draw a little sketch, you will see that the direction of the line of action of b is 
given by a 2 - a t = 2i - j + k. All the possible solutions for a are given by substituting 
different values for t in the equation a = i + 2j - k + t(2i - j + k). Notice that putting 
t = 0 gives a x and putting t = 1 gives a 2 . We look at equations of lines in more detail in 
Section ll.C.(a). 

(4) One possible vector which is perpendicular to u = 2i - j - 2k is v = 3i + 4j + k. Then a 
vector w which is mutually perpendicular to u and v is given by 



W = U X V 



1 J 

2 -1 

3 4 



k 

-2 
1 



7i - 8j + Ilk. 



Now | u | = -J9 = 3 and |v| = ^26 and |w| = ^234 so the three mutually per pend icular unit vectors 
are given by u = 1/3 (2i - j - 2k) and \ = \l^26 (3i + 4j + k) and w = 1/^234 (7i - 8 j + Ilk). 
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Exercise 11.B.4 



(1) Here is the working for u = (a x b) x c. We have 

2i - j - k so u = (a x b) x c = 



a x b = 



i j k 

1 2 0 
0-1 1 



1 j k 

2 -1 -1 
2 1 -2 



= 3i + 2j + 4k. 



Now u = 3i + 2j + 4k = 3(i + 2j) + 4(-j + k) = 3a + 4b so u lies in the plane of a and b. 
Next we find a x (b x c). Working out b x c first, we have 



b x c 



1 j k 

0-1 1 

2 1 -2 



= i + 2j + 2k so v = a x (b x c) 



i j k 

1 2 0 
1 2 2 



4i - 2j. 



Now v = 4i - 2j = 4(-j + k) + 2(2i + j - 2k) = 4b + 2c so v lies in the plane of b and c. 

(2) I will show the working out only for the answer to (1) given above. 

We have found that a x (b x c) = 4i - 2j. Now, working with column vectors, we have 



1 

a • c = 2 

V o 



4 and a • b 



So (a • c) b - (a • b) c = 4b + 2c = 4(-j + k) + 2(2i + j - 2k) = 4i - 2j = a x (b x c). 
The working using the results from my numerical example is exactly similar. 



Exercise 11. B. 5 

(1) We work out a x b first and then use this to find (a x b) • c. 



a x b 



i j k 

3-12 
1 -2 2 



i j k 

1 -2 2 

2 4-1 



= -6i + 5j + 8k so a • (b x c) = y -1 



2i - 4j - 5k so (a x b) • c = y 

(2) We work out b x c first and then use this to find a • (b x c). 
b x c 

(3) We work out axe first and then use this to find b • (a x c). 



2 
4 
-1 



~ 6 \ 

5 =-7. 





i 


j 


k 








a x c = 


3 


-1 


2 


= -7i + 7j + 14k so b • (a x c) = | 


.-•)• 


(" ; 




2 


4 


-1 






\ 14 



The answer to (3) has the opposite sign to the answers to (1) and (2) because of the flip of 
a and c. The letters no longer follow each other round in the order abcabc . . . 

(4) (a) a x b = d, say, where d is perpendicular to the plane in which a, b and c lie. So d x c 
lies in the plane, in a direction perpendicular to c. 
(b) (a x b) • c = 0 since d is perpendicular to the plane but c lies in it, and so d • c = 0 
The three vectors form a squashed box which has zero volume. 



Exercise ll.C.l 

(1) (a) Working with column vectors, we have 



y) V-3/ \2 



The components of the vectors are equal so x — 2 + 1 and y = -3 + 2t. From this we have 
t=x — 2 and so y = -3 + 2(x — 2) = 2x — 7. The Cartesian equation of this line is y = 2x - 7. 

618 Answers to the exercises 



(b) Working with column vectors, we have 

-0 -(>•(:' 

The components of the vectors are equal so x = 1 and y = -2 + At. The Cartesian equation 
of this line is x = 1. Since its direction vector is 4j, it is parallel to the y-axis. Each value 
of t gives a different point on the line. For example, putting t = 2 gives the point (1, 6). 

(2) (a) The general vector equation of a line is r = a + td. There are lots of possible ways for 
writing the equation of the line asked for here depending on which point A you choose on 
the line to give a and how you choose the direction vector d. 

We have y = 3x + 5. 1 will choose (0, 5) to be A so a = 5j. The gradient of the line is 
3 so I'll let d = i + 3j. 

This gives the vector equation of the line as r = 5j + t(i + 3j). 
(b) This line has the Cartesian equation of 2y = 5x - 3. I will choose the point (1, 1) to be A 
giving a = i + j. The gradient of the line is 5/2 so I'll take d = 2i + 5j. This gives the vector 
equation of the line as r = i + j + f(2i + 5j). 



Exercise 11.C.2 

(1) Let the point of intersection of the two lines be r = p. Working with column vectors, this gives 
us 




The separate components of the vectors are equal so we have 

1 + 3s = 6 + 2t (1) 
\-2s = t (2) 

Using (2) to substitute in (1) gives 1 + 3s = 6 + 2(1 - 2s) so s = 1 so t = -1. This gives 
p = 4i- j. 

(2) L x is given by r = i + 3j + t(2i + 3j ). Let L 2 be r = a + td. We know L 2 passes through r = 3i + 4j 
so we can choose a = 3i + 4j. 

Also, the lines are perpendicular so d • (2i + 3j) = 0. Therefore, a possible choice for d is 
-3i + 2j. (Or you could have 3i - 2j.) 

The vector equation of L 2 can be written as r = 3i + 4j + f(-3i + 2j). 



Exercise 11.C.3 



(1) This pair of lines is parallel because their direction vectors are parallel. 

(2) If this pair of lines meet at P with position vector p then we would have 




For this equation to have a solution, each of the three components of the vectors must be 
equal, so we get 



-1 + 2s 
-1 

5 - 3s 



4 + t 
-4 - t 
2 



(1) 
(2) 
(3) 



From (2) we have t = -3 and from (3) we have s = 1 . 

Now we must check whether these values fit (1). Substituting, we have the LHS 
RHS = 1 so therefore the two lines do meet at the point with p = i - j + 2k. 



1 and the 
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(3) If this pair of lines meet at P with position vector p then we would have 

'-(;Mi)-(i) + '(?)- 

For this equation to have a solution, each of the three components of the vectors must be 
equal, so we get 

f 2+5 =1 (1) 
3i =3 + 2? (2) 
I 1 + 4s = t (3) 

From (1) we have s = -1 which gives t = -3 from (2). 

Now we must check whether these values fit (3). Substituting, we have the LHS = -3 
and the RHS = -3 so therefore the two lines do meet at the point with p = i - 3j - 3k. 

(4) The two lines aren't parallel since their direction vectors aren't parallel. If this pair of lines 
meet at P with position vector p then we would have 

+ s 

For this equation to have a solution, each of the three components of the vectors must be 
equal, so we get 

f 1+5 = 2 + t (1) 
3-5 =6-3/(2) 
I 2 + 2s = 4 + 2t (3) 

(1) + (2) gives 4 = 8 - 2t so t = 2. Substituting this in (2) gives 5 = 3. (It's important 
to stay with the same two equations to find a solution for s.) 

Now we check whether these values also satisfy (3). Substituting gives the LHS = 8 and 
the RHS = 8 so therefore the two lines do meet at the point with p = 4i + 8k. 

(5) If this pair of lines meet at P with position vector p then we would have 

'-(iM-D-tiMi)- 

For this equation to have a solution, each of the three components of the vectors must be 
equal, so we get 

[ 1 =3+2/(1) 
-2-5 = -6 + 2t (2) 
[ 1 + 25 = 3 - t (3) 

From (1) we have t = —l. Substituting this value in (2) gives 5 = 6. 

Now we check whether these values fit (3). Substituting, we have the LHS = 13 and the 
RHS = 4. Therefore these equations have no solution and this pair of lines does not meet. 
Since they are not parallel either, they are skew lines. 

Exercise 11.C.4 

(1) (a) Letting r = xi + y\ + zk we have 




This gives us the three equations 

f x = 1 + 3? (1) 
>> = -2 + 2/ (2) 
I z = 5 - It (3) 
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and rearranging each of these in the form t = . . . gives us the Cartesian equation 
x — 1 y + 2 z — 5 5 — z 
3 2 -7 7' 

(b) Letting r = xi + jj + zk we have 

•-(;)-(•)•■(-:> 

This gives us the three equations 

x = 3+2? (1) 
y = -t (2) 
z = 5 + t (3) 

and rearranging each of these in the form t=... gives us the Cartesian equation 
1 

— (x - 3) = -y = z - 5. 
2 

(c) Letting r = xi + y\ + zk we have 

r -(i)-(i) +, (i> 

This gives us the three equations 
x = t (1) 

y = 2 (2) 

z = -3 - * (3) 

and rearranging (1) and (3) in the form t = . . . and putting y = 2 gives us the 
Cartesian equation 

x = -z -3 = - (z + 3) and y = 2. 

(2) (a) We start by putting all three expressions equal to t. This gives us 
x - 5 7 + 3 z 
2 4 5' 

Working with t, and each of the x, _y and z expressions in turn, gives us the three 
equations below for x, y and z. 

x = 5+2? (1) 
y = -3 + 4? (2) 
z = 5? (3) 

Writing these equations as components in column vector form gives us 

r -(i)-(i) + 'G) 

which is the vector equation of the line, 
(b) Again we start by putting all three expressions equal to t. This gives us 

t = x = = z + 1. 

3 

Working with t, and each of the x, y and z expressions in turn, gives us the three 
equations below for x, y and z. 

x = t (1) 
y = 2 + 3? (2) 
z = -1 + t (3) 
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Writing these equations as components in column vector form gives us 

■■(:)■(])•■(:) 

which is the vector equation of the line. 

(c) This time we put the first two expressions equal to t but, since z = 0, we must deal with 
it separately. We have 

t = 3x = 5 - y and z = 0. 

This gives us the three equations below for x, y and z. 

x = \t (1) 
y = 5-t (2) 
z = 0 (3) 

Writing these equations as components in column vector form gives us 

■-GMiM 3 .) 

We could write this vector equation more neatly as 

■■(-:)■(•)••(-:) 

1 

with s = — t. 
3 

(3) (a) AB = -a + b = -4i + 3j gives us a direction vector for this line. Since A lies on it, we 
can write its vector equation as r = 4i + ?(-4i + 3j). 
(b) For the Cartesian equation of this line, working in column vectors, we have 

■-(!)-(!)"(>} 

This gives us the three equations 

x = \-\t (1) 
y = 3t (2) 
z = 0 (3) 

Rearranging these gives us the Cartesian equation 
4 - x y 

= — and z = 0. 

4 3 

Exercise 11.C.5 

For each of these solutions, I shall use the vector equation of a plane in the form r = a + sd + te for 
reference. 

(1) We have d = 2i - 4j - 3k and e = - i + 4j. 

Now d x e gives n, a normal vector to the plane. 

This gives us n = d x e = 12i + 3j + 4k. (The working is similar to Exercise 1 l.B.3(l)(b).) 
Since |n| = 13 we have n = (12i + 3j + 4k). 

Also, for this plane we have a = 4i + j, so using r • n = a • n = D gives us 



1 / 








D = a*n = n'a = — 


:)■ 




I--- 


13 ' 






' 13 



The perpendicular distance from the origin to this plane is 51/13 units. 
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(2) We have d = 3j + 4k and e = 3i + 2j. 

Now d x e gives n, a normal vector to the plane. 

This gives us n = d x e = -8i + 12j - 9k. (The working is similar to Exercise 
ll.B.3(l)(c).) Since |n| = 17 we have A = ^ (-8i + 12j - 9k). 

Also, for this plane we have a = 2i - 3j - k, so using r • ft = a • ft = D gives us 



D = a*fl = fl'a = 




So \D\, the perpendicular distance from the origin to this plane, is 43/17 units. 

(3) We have d = 3i + 2j + 2k and e = -3i + j + 4k. 

Now d x e gives n, a normal vector to the plane. 

This gives us d x e — 6i - 1 8j + 9k. (The working is the same as Exercise 
ll.B.3(l)(d).) Since |n| = 21 we have ft = \ (2i - 6j + 3k). 

Also, for this plane we have a = 3j + 6k, so using r • ft = a • ft = D gives us 



£> = a*fi = fi«a = 




We have found that the origin actually lies in this plane. You can see that this will be so 
because, for this plane, a = d + e, meaning that the vector a itself lies in the plane. 



Exercise 11.C.6 

(1) (a) We have n = 3i + 2j - 6k so |n| = ^49 = 7 and ft = fn. Therefore, to write the equation of 

the plane in terms of ft, we must divide by 7. 

This then gives us ft • r = j (3x + 2y - 6z) = 6/7 = D. The perpendicular distance from 
the origin to this plane is 6/7 u nits. 

(b) We have n = 8i - 15k so |n| = ^289 = 17 and ft = -p^n. Therefore, to write the equation of 
the plane in terms of ft, we must divide by 17. 

This then gives us ft • r = (8x - 15z) = 12/17 = D. The perpendicular distance from 
the origin to this plane is 12/17 units. 

(c) We have n = 2i - j + 2k so |n| = ^9 = 3 and ft = 511. Therefore, to write the equation of 
the plane in terms of ft, we must divide by 3. 

This then gives us ft • r = 3 (2x - y + 2z) = 7/3 = D. The perpendicular distance from 
the origin to this plane is 7/3 units. 

(d) We have n = 4i - 7j - 4k so |n| = y f 81 = 9 and ft = \n. Therefore, to write the equation of 
the plane in terms of ft, we must divide by 9. 

This then gives us ft • r = <j (4x - ly - 4z) = 1 8/9 = 2 = D. The perpendicular distance 
from the origin to this plane is 2 units. 

(2) The vectors PQ and QR lie in the plane. Pg = -p + q = 2i-j + 6kand0tf = -q + r = 
— 2j + 6k. A normal vector to the plane is given by 

— > — > 1 J k 

PQxQR = 2-1-6 = 6i - 12j - 4k. 

0-2 6 

So we could most conveniently take n as 5 (6i - 12j — 4k) = 3i — 6j - 2k. 

Now we use the equation for the plane of n • r = n • a. Here, we know that P, Q and R 
all lie in the plane, so any of p, q or r could be used for a. 

I'll choose p. This gives 

\ / 3 \ / _1 \ 

-21 




so the Cartesian equation of this plane is 3x - 6y — 2z = —21. Check for yourself that you 
get the same answer if you use q or r for a. 
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(1) Suppose L and II meet at P with position vector p from the origin. Then p = xi + y\ + zk must 
satisfy the equation of the line. So 

But p also satisfies the equation of the plane so we have 

2(-2 + 30 + 3(5 - 0 - 4(1 + t) = 3 
which gives f = 4. Therefore p = lOi + j + 5k. 

(2) This time we have 

But p also satisfies the equation of the plane so we have 

2(3 + 20 + 3(-5 + 30 - 4(4 - At) = A 
which gives t = 1. Therefore p = 5i - 2j. 

(3) This time we have 

-(:)-(•)•'(-•)-(■•?,) 

But p also satisfies the equation of the plane so we have 
4(1 + 20 - 5(-30 + (-4 + t)=U 

3 7 

which gives t = 1/2. Therefore p = 2i - - jk. 

(4) This time we have 

p ^Mi) +, (iMiB 

But p also satisfies the equation of the plane so we have 
2(1 + 20 + 5(2 - 30 -3(-5 + 0 = 6 

5 7 

which gives t = 3/2. Therefore p = 4i - g - 5k. 



Exercise 11.C.8 

(1) The normal vectors to the two planes are given by iij = 2i - 5j + k and n 2 = i + 2j - k. 

Now, working out n x x n 2 gives 3i + 3j + 9k but it is neater to take d as i + j + 3k. 
The equations of the two planes are 2x - 5y + z = 4 and x + 2y - z = 5. 
Putting y = 0 in each of these gives 

' 2x + z = 4 (1) 
_x-z =5 (2) 

Adding (1) and (2) gives x = 3 so z = -2. Therefore the point with position vector 3i - 2k lies 
on both planes. We can now write the equation of the line of intersection as r = 3i - 2k + 
t(i + j + 3k). 

(2) This time the two normal vectors are parallel since n 2 = -3nj so the two planes are parallel. (We 
can check that they aren't in fact the same plane by multiplying the equation for n 3 by -3.) 
There is no line of intersection. 
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(3) The normal vectors to the two planes are given by iij = 4i - j + k and n 2 = 2i + 3j - 7k. 

Now, working out iij x n 2 gives 4i + 30j + 14k but it is neater to take d as 2i + 15j + 
7k. 

The equations of the two planes are 4x - y + z = 9 and 2x + 3y - 7z = 8. 
Putting z = 0 in each of these gives 

' Ax-y = 9 (1) 
2x + 3y= 8 (2) 

Adding (1) and -2 x (2) gives —ly = -7 so y — 1 so x = 5/2. Therefore the point with position 
vector (5/2) i + j lies on both planes. It is important to note here that we can't double up this 
vector to make it look nicer because it is a position vector and we mustn't alter its length. We 
can now write the equation of the line of intersection as r = (5/2) i + j + t (2i + 15j + 7k). 



Exercise ll.D.l 

In this exercise, I'll refer to each pair of lines using r = aj + sdi and r = a 2 + td 2 . 

(1) For the two lines given in 1(b), we have d 2 = -2d t so the lines are parallel. 

For the two lines given in 1(d), we have d x • d 2 = 0, so these two lines are perpendicular. 

For the two lines given in 1(a), we have aj • a 2 = 0 so the position vectors from the origin 
to the known points on each line are perpendicular to each other. The lines themselves are 
skew. 

For the two lines given in 1(c), we have a 2 = -2a! so the position vectors from the origin to 
the known points on each line lie on a straight line through the origin. The lines themselves are 
skew. 



(2) Suppose the acute angle between the two lines is 6. 
(a) We have 



|di • d 2 | = 



12 



= | -8 | = 8 = V«T \fl69 cos 6 



so cos 6 = 8/117 and 6 = 86.1° to 1 d.p. 
(b) This time we have 



1^ • d 2 | = 



= 3 = ^26 V54 cos 6 



so cos e = 3/(^26 ^54) and 6 = 85.4° to 1 d.p. 



Exercise 11. D. 2 

In my solutions, I'll refer to the normal vectors to each of the two planes as ii! and n 2 . 



(1) To find 9, the acute angle between the planes, we have 



| ni * n 2 | 



4 = V35 ^30 cos 6 



so cos e = 4/(V35 ^30) and 9 = 82.9° to 1 d.p. 



(2) This time we have 
|i>i • n 2 | = 



2 



= 0 



so the two planes are perpendicular. 
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(3) Now we have 
In! • n 2 | 



= | -4 | = 4 = ^59 V42 cos 9 



so cos 9 = 4/(^59 V42) and 9 = 85.4° to 1 d.p. 



Exercise 11.D.3 

In each solution, I am using the rule |n • d| = |n||d| sin 9. 

(1) We have n = 5i - 2j + k and d = 3i - 2j + k so 

|n • d| = | 20 | = 20 = ^30 y[l4 sin 9 
so 9 = sin- 1 (20/V30V~14) = 77.4° to 1 d.p. 

(2) We have n = 2i - 5k and d = i - 7j + 3k so 

|n • d| = | -13| = 13 = V29 ^59 sin 6 

so 9 = sin" 1 (13/V29 V59) = 18.3° to 1 d.p. 

(3) We have n = 3i - j + 2k and d = -5i + 7 j + Ilk. This time, we have |n • d| = 0 so sin 9 = 0 
so 9 = 0. Either this line lies in the plane or it is parallel to it. We know that the point with 
position vector i + j + 2k lies on the line. Substituting x= l,y = 1 and z = 2 in the equation 
3x — y + 2z = 5 of the plane gives the LHS = 6, so this point does not lie in the plane. Therefore 
the line is parallel to the plane. 



Exercise 11.D.4 

In all my solutions, I have let H be the foot of the perpendicular from P to line L with OH = h. In 
each case we have to find PH. 

(1) Since H lies on L we can say 

h = I 0 +/ 1 I = I t I for some value of t. 



Therefore 

> I 2 \ / -1 + 4f \ / -3 + At 

P//=-p + h = - -l + / = l+t 
\ 2 I \ 7 - 2t I \ 5 - 2t 

Now PH is perpendicular to 4i + j - 2k so we have 



-3 + At \ 




l + t • 


u 


5 -It 1 





This gives t = 1 so PH = i + 2j + 3k and | PH \ = t/14 = 3.74 units to 2 d.p. 

(2) This time we have 

/ 2\ /1\ / 2 + t \ 
h = -11 + t\ A = -11+4? for some value of t. 
\ -8 / V 3 / V -8 + 3/ / 

PH = -p + h = - -1 + -11+ At \ = \ -10 + At . 

V 5 / V -8 + 3t / \ -13 + 3t I 



626 



Answers to the exercises 



Now PH is perpendicular to i + 4j + 3k so we have 

I l+t \ ( l \ 
-10 + At • 4 = (1 + t) + 4(-10 + At) + 3(-13 + 3t) = 0. 

V-13 + 3*/ \3/ 

This gives t = 3 so PH = 4i + 2j - 4k and | PH \ = ^36 = 6 units. 
(3) Working as before, we have 

/1\ / 2\ /1 + 2A 
h = I 0 I + t\ 1 = t I for some value of t. 

/ -3 \ / 1 + 2A / A + 2t\ 

Now PH is perpendicular to 2i + j - k so we have 
/4+2A / 2\ 

y 2 + t J • y 1 J = 2(4 + 2t) + (2 + f) - (-2 - t) = 0. 

This gives t = -2 so P// is, in fact, the zero vector. P actually lies on line L so its distance 
from L is zero. The fast way to spot this answer was to notice that putting t = -2 'm the 
equation of L gives the position vector of P so P lies on L. 



Exercise 11. D. 5 

(1) In both (a) and (b) we find the shortest distance of A from II by working out | n • a - D\. 
(a) Working as in the example I have given in the text, the equation of II is 2x - y + 2z = 11 
so n — 2i - j + 2k and |n| = 3. 

Therefore n = (l/3)n and r • n = 11/3 so D = 11/3 units. 

Next, the distance of A from O in the direction of n is given by n • a. 

Working with column vectors, we have 



n • a = 




Therefore the shortest distance of A from II is |(-7/3) - (11/3)| = 6 units. 

(b) The equation of II is 3x + I2y - Az = 79 so n = 3i + 12j - 4k and |n| = y[l69 = 13. 
Therefore n = (l/13)n and r • ft = 79/13 so D = 79/13 units. 
Next, the distance of A from O in the direction of ft is given by ft • a. 
Working with column vectors, we have 



ft • a = 




Therefore the shortest distance of A from II is |(78/13) - (79/13)| = 1/13 units. 



(2) (a) We can write the equation of the line AL as r = i - 24k + t{2\ - 2 j + k). So, working 
with column vectors, we can say 

^(-i) +, (iM-£)' 

The line intersects the plane where r = 1. 

There we have a value of t such that 2(1 + 2t) 
also satisfy the equation of the plane. 

This gives t = 3 so 1 = 7i - 6j - 21k and AL = 



- 2(-2f) + (-24 + 0 = 5 since 1 must 
-a + 1 = 6i - 6j + 3k. 
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Since | AL \ = y 81 = 9, the perpendicular distance from A to II is 9 units. 

(This method provides an alternative way to find the shortest distance from a point to 
a plane. It also gives the position vector of the foot of the perpendicular from the point 
to the plane.) 

(b) We can write the equation of the line BS as r = i + 4j + 2k + f(2i - 2j + k). 
So, working with column vectors, we can say 




The line intersects the plane where r = s. 

There we have a value of t such that 2(1 + 2i) - 2(4 - 2t) + (2 + t) = 5 since s must 
also satisfy the equation of the plane. 

This gives t = 1 so s = 3i + 2j + 3k and B~S = -b + s = 2i - 2j + k. 

Since | BS\ = y[9 = 3, the perpendicular distance from B to II is 3 units. 

Notice that AL = 3BS meaning thatv4 and B are both on the same side of II and therefore 
it is physically possible to have a ray reflected from A to B by striking plane II 

(c) The two triangles ALM and BMS are similar because both triangles are right-angled and 
the angles at M are equal. Therefore ALIBS = LMIMS but ALIBS =9/3 = 3/1. 

Since M divides LS in the ratio 3 : 1, we have 

l(7i - 6j - 21k) + 3(3i + 2j + 3k) 

m = = 4i - 3k. 

4 

(d) We want to find AAML. We can do this using the dot product of AM and LM. 

We have AM = -a + m = 3i + 21k and LM = -1 + m = -3i + 6j + 18k. 




AM ' LM = y 0 j • | 6 | - 369 - x 450 , 369 cos AAML 
so AAML = 25.1° to 1 d.p. 



Exercise 11.D.6 



Working as in the example I have given in the text, we have a t = 3i + j + 7k and d x = j + 2k 
for line L x . 

For line L 2 , we have a 2 = -i + 3j + 2k and d 2 — 3i + 3j + 4k. 

Plane Ilj has L 1 and a shifted copy of L 2 lying in it. This plane has a normal vector of 
n = dj x d 2 . Working this out gives 



n = d x x d 2 = 



i J 

0 1 

3 3 



2i + 6j - 3k. 



Now |n| = V49 = 7 so n = 7 (-2i + 6j - 3k). 



TIj has the equation r • n = a x 



D l = $ (-6 + 6- 21) = -3. 



Plane YI 2 nas ^2 an d a shifted copy of L l lying in it and has the same unit normal vector n. 
Its equation is r • n = a 2 • n = D 2 = j(2 + 18 - 6) = 2. 

The shortest distance between the two skew lines is \D X - D 2 \ = | - 3 - 2| = 5 units. 
Working as in the example I have given in the text, we have a t — 3i + 2j - 5k and d t = 
2i + 4j - 5k for line L x . 

For line L 2 , we have a 2 = i - 3j - 5k and d 2 = i + k. 

Plane rij has L 1 and a shifted copy of L 2 lying in it. This plane has a normal vector of 
n = dj x d 2 . Working this out gives 



n = di x d 2 



i 

2 
1 



Now |n| = v '81 = 9 so n 



J 

4 - 
0 

|(4i 



4i - 7j - 4k. 



7j - 4k). 
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Ili has the equation r«n = a 1 , n = Z)] = g(12 - 14 + 20) = 2. 

Plane H 2 has Z 2 and a shifted copy of lying in it and has the same unit normal vector n. 



Its equation is r • n = a 2 



D 2 = s(4 + 21 + 20) = 5. 



The shortest distance between the two skew lines is \D l - D 2 \ = \2 - 5| = 3 units. 
(3) Working as in the example I have given in the text, we have a t = 4i - j - 2k and d x = 
4i + 4j + 3k for line L x . 

For line L 2 , we have a 2 = -i + 2j + 3k and d 2 = 4i - 3j + 3k. 

Plane ilj has L x and a shifted copy of L 2 lying in it. This plane has a normal vector of 
n = dj x d 2 . Working this out gives 



n = dj x d 2 



J 

4 

-3 



= -21i 



28k. 



Now |n| = Vl225 = 35 so n = j(3i - 4k). 

II l has the equation r«n = a 1 m A = D t = j(12 + 8) = 4. 

Plane Tl 2 has L 2 and a shifted copy of L x lying in it and has the same unit normal vector n. 
Its equation is r • n = a 2 • n = D 2 = j(-3 - 12) = -3. 

The shortest distance between the two skew lines is \D l - D 2 \ = |4 - (-3)| = 7 units. 



#Each of the problems in this exercise can also be solved by substituting directly 
into the general rule which I gave on p. 516. In each case, the shortest distance 
is found by calculating | (a t - a 2 ) • n |. 
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(Note: If you are searching for a broad topic, you may find it helpful to look at the very detailed contents list at the beginning of the 
book.) 



absolute value 236, 381-2 
acceleration 297-8, 372, 378, 418 
adjacent 133 

amplitude 198, 200, 449-50 

analysis, mathematical 300, 308, 349, 351, 

377 
angles 

acute and obtuse 152 

between lines and planes 508-12 

between two vectors 483 

in circles 156-7 

of elevation 134 

interior and exterior 139, 142, 172-3 

measurement of 135, 165-9 

in polygons 142, 172-3 

reflex 157 

in triangles 139 
angular frequency 200 
angular momentum 488 
angular velocity 183 
antidifferentiation 371, 375 
arc 154, 165, 169 

arccos, arcsin, arctan functions 186, 216 

Argand diagram 424 

argument see complex numbers 

arithmetic mean 232, 245 

arithmetic series (APs) 230-33 

common difference 230, 233 

sum of 231-2 
arrangements (permutations) 263-5 
Arrhenius' equation 414 
asymptotes 91, 111, 112, 184, 328, 341 
average 232 

base 27, 118-19, 307-8 
beats 205 

binary numbers 33-4 

binomial expansions 261-3, 266-72, 458 
coefficients for 262-3, 267-71, 444-6 
finding terms in 269-72, 276-9 
general terms of 267-9 
for negative and fractional powers 275-9 
and Pascal's Triangle 262-3, 269-71, 
444-6 



probabilities connected with 272-5 

and selections 266-7 
Binomial Theorem 283-4 
Bhoskara and Pythagoras' Theorem 138 
Boyle's Law 91, 92 

Cartesian coordinates 429 

centroid of a triangle 612 

chain letters 237-8 

Chain Rule 309-13, 384, 417 

changing the subject of a formula 43-5, 

109-10, 468 
Charles' Law 85, 92 
chord 154, 155 
circles 154-69 

angles in 156-7 

arc length 165 

area of 155 

circumference of 154, 155 
equations of 158-64, 353-6, 459-69 
parts of 154-5 

segment and sector areas 167-9 

tangents to 155, 160-4, 353-6 
circuits 212, 448-50 

see also current and voltage 
coefficient 74, 159 

see also binomial expansions 
combinations (selections) 265-6 
common difference see arithmetic series 
common ratio see geometric series 
completing the square 63, 102, 160, 393-4, 
465 

complex numbers 33, 201, 426 
adding and subtracting 430-1 
Argand diagram 424 
argument (arg) 427 
and circuits 448-50 
conjugates 436-7, 443, 455-7, 466-7 
cube roots of unity 435 
De Moivre's Theorem 444 
dividing 435-6 

equations involving 425-6, 428, 438-9, 

450-58 
exponential form 441 
in fractions 437 



imaginary 33, 424, 426 
modulus (mod) 426-7 
multiplying 431-4 
principal value of argument 427 
r, 0 form 429 

real and imaginary parts 426, 438 

and regions 458-69 

roots of 33, 425-6, 433-5, 450-58 

and transformations 460-69 

used for trig identities 444-7 
components of vectors 478-9 
compound interest 243-4, 301-304 
concentric 160 
cone 344, 416 
congruent triangles 140-51 
consistent equations 57, 498 
continuity 300, 351 
convergence see series 
coordinates 47, 429 
cos see trig functions 
cosh x 319 

see also hyperbolic functions 
Cosine Rule for triangles 149-53, 180 
counting numbers 30, 280 
cover-up rule (partial fractions) 252 
cross product 486, 489-90 
cubic equations 76-8, 455-8 
current (electric) 176, 197, 200, 208, 212, 
448-50 

curve sketching see graph sketching 
cycle 182, 198, 202, 204, 213, 450 
cyclic quadrilateral 174 
cylinder 45, 86, 344 

decimal numbers 33, 241-42 

degree (angle measurement) 135, 168-9, 

312-13 
De Moivre's Theorem 444 
denominator 16, 21 

lowest common 22 

rationalising the 37, 437 
dependent variable 46 
derivative 298, 315, 336, 364, 371 
difference equations 230, 244 
difference of two squares, 13 
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differential equations 298, 315, 409-21 
and chemical reactions 413-14, 418 
giving equations of motion 418-21 
and Newton's Law of Cooling 415-16 
and radioactive decay 411-13 
and rate of change of volume 416-18 
and separating the variables 409-11 
and SHM 296, 417-18, 447-8 
using x = e kt 315, 420, 447 

differentiation 293 
antiderivative 371, 375 
Chain Rule 309-13, 384, 417 
dash and dot notations 293-4, 298, 348, 
447 

derivative 298, 315, 336, 364, 371 
finding maximum and minimum 

338-40, 378 
of function of a function 309-13 
implicit 353-63 

of inverse functions, 331-4, 358-61 

point of inflection 338-40, 378 

possible problems 299-300 

Product Rule (uv) 313-15 

Quotient Rule (u/v) 315-17 

and rate of change 291, 318, 336, 339, 
353, 373, 413, 416 

second derivative 298, 315, 336, 372 

see also exponential, log, hyperbolic and 
trig functions 
direct proportion 84-90 
direction cosines 480 
direction vector 494 
discontinuities 113, 184, 195, 299-300, 

351, 381 
disintegration constant 411 
distance between two points 53-4 
divergence see series 
divisor and dividend 79 
domain 96, 114 
dot product 481-2 
dummy variables 246, 401 

e 122-4, 300-4 

connection with complex numbers 
438-41 

and compound interest 301-304 

numerical value of 122, 303 

see also exponential function 
ellipse 331 
enlargements 134 
equations 38-43 

of circles 158-64, 353-6, 459-69 

complex 425-6, 428, 438-9, 450-8 

and identities 72-5 

of motion 45, 372, 377-8, 418-19 

of planes 501-6 



simultaneous 56-60, 161 

of straight lines, 50-53, 493-501, 506 

see also quadratic equations and cubic 
equations and SHM 
equilateral 140 
Euclid and primes 35-6 
Euler and complex numbers 441, 442 
exp notation 122-3, 451 
exponential functions 116-18, 122-6, 
304-8, 318, 449-50, 451-3 

differentiating 305, 311, 361-2 

growth and decay 116-18, 124-6, 
305-7, 311 

integrating 384, 400 

inverse of 118, 124, 306 

series forms of 303, 365-6, 369, 440 

see also e and logs 



Factor Theorem 80-1 

factorials 227, 265 

factorisation 11-12, 14-15, 22, 62 

Fibonacci sequence 227, 229, 230 

focal length of a lens 44 

formula 

changing the subject of 43-5, 109-10, 
468 

for solving quadratic equations 65-6 
Fourier series 204, 406 
fractions 

adding and subtracting 20-4 

cancellation of 17-20, 25-6 

complex 437 

equivalent 17 

multiplying and dividing 25-6 

partial 249-58, 278, 395-7 

rationalising the denominator 37, 437 

top-heavy 255-7 
frequency 198, 200, 449-50 

angular 200 
functions 96 

domain of 96, 114 

even and odd 115, 177, 184 

exponential see exponential functions 

f(x) notation 76, 95 

of functions 104-6, 310 

hyperbolic see hyperbolic functions 

inverse 106, 306, 358-61 

as mappings 107 

maximum/minimum of see 
differentiation 

one-to-one 96, 106 

range of 96, 114 

reciprocal 133, 190-6, 321 

self-inverse 107 

stretching and shifting 96-104, 114, 
196-8 



trig see trig functions 
Fundamental Theorem of Calculus 375 



general solutions for trig equations 216, 

218, 220, 222 
general terms of series see terms 
geometric mean 245 
geometric series (CPs) 233-45, 301-4 

common ratio 233, 238 

sum of 234-5 

sum to infinity of 235-6 
Gibb's phenomenon 204 
gradians 165 
gradient 

of a curve 163, 291-2, 317, 334, 357, 

370, 373 
of perpendicular lines 54 
of a straight line 49-50, 287, 294-5 
graph sketching 
of circles 158-60 

general rules for curves 69, 111-14, 

195, 340-51 
of straight lines 50-1 
of hyperbolic functions 319-20, 325, 

327, 328-9 
of trig functions 176-9, 184, 187-9, 

193-4, 196-8, 202-5, 209-13 



half-life 412 
harmonic oscillator 200 
harmonic series 260, 368, 382 
hertz 198 
hexagon 172 
hyperbola 114, 331 
hyperbolic functions 318-34 

differentiating 321, 322, 331-4, 359-61 

identities using 320-3 

integrating 390-2, 396 

inverses of 323-30, 359-61 

reciprocals of 32 1 

relation with trig functions/O sborn's Rule 

322-3, 443 
series forms of 367, 369, 440 
sinfr 1 , cosh -1 and tanlr 1 see 

hyperbolic functions, inverses of 
hypotenuse 133 

identities 74-5, 76, 190-2, 208, 249, 253 
see also hyperbolic functions and trig 
functions 

imaginary axis see Argand diagram 
imaginary numbers see complex numbers 
implicit differentiation 353-63 
independent variable 46 
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indices see powers 
induction, proof by 279-85 
inflection, point of 338, 340, 378 
integers 30 
integration 376 

antiderivative 371, 375 

and area 373-79 

using completing the square 393-4 

constant of 375, 377 

definite integral 375, 376 

indefinite integral 376 

using inverse functions 391-4 

limits for 375, 379 

using partial fractions 395-7 

by parts 397-402 

using reduction formulas 402-6 

by substitution 384-9 

using the t = tan(x/2) substitution 406-8 
inv (calculator) 121, 136, 185, 324 
inverse functions see under functions 
inverse proportion 90-1 
irrational numbers 32, 66 
isosceles 139 



Kelvin temperature scale 90 

kinetic energy 87 

Kirchhoff 's Junction Rule 449 



l.c.m. 204 

Lami's theorem 473-4 
limits, limiting values 112, 171, 236, 274, 
291, 299, 599 
see also integration 
linear factor 254 

linear forms and logs 126-31, 414 

In 123, 306, 380-2 

locus 459 

logs 

antilog 410 

base 27, 118-19, 307 

changing bases 307-8, 313 

and differentiation 308-9, 313 

and integration 380-2, 388, 400, 404 

inverse of 118-19, 121, 306 

the (three) laws of 120-2 

and linear forms 126-30, 414 

In 123, 306 

natural 123, 306, 313, 361 

and series 367-9 
long division in algebra 79-80, 257 
lotteries 274-5 



Maclaurin series 363-9, 440 
mapping 107, 467-9 



maximum and minimum 338-40, 378 

applications of 343-8 
median 475 

midpoint of a line 47-8 
modulus see complex numbers 
moment of a force 487, 490 
moment of momentum 488 
momentum 488 



natural logs see logs 
natural numbers 280 
Newton's Law of Cooling 415-16 
Newton's Law of Gravitation 92 
Newton-Raphson Rule 348-53 
Normal distribution 102-4 
normal vector 503-4 
numerator 16 



odd functions 115, 177, 184, 328 
one-to-one functions 96, 106 
ordered pairs 47 
origin 47 

orthogonal vectors 482 
Osborn's Rule 322-3, 443 



parabola 61, 65 
parallelogram 141 
parameter 331 

partial fractions 249-58, 278 

in integration 395-7 
partial sum 237, 276 
Pascal's Triangle 262-3, 269-71, 444-6 
pendulum 43, 88, 200, 298 
perfect squares 36, 66 
perimeter 167 

period, periodic 177, 184, 197, 198, 200, 
204 

permutations (arrangements) 263-5 

phase angle 210 

phase difference 449-50 

phasor 450 

polar coordinates 429 

polygon 142 

position vector 47 1 

powers 7-8, 26-9, 120 

prime numbers 35-6 

principal value 
of argument 427 
of the inverse of cosh 327 
of trig functions 217, 219, 220, 222 

Product Rule for differentiation 313-15 

proof by induction 279-85 



Pythagoras' Theorem 137-8, 145 
Pythagorean triples 138, 434 
Pythagoreans and rational numbers 32, 138 



quadrant 154, 179, 428 

quadratic equations 60-72, 161-2 

completing the square 63 

with complex roots 425-6, 428, 433, 
454-5 

factorising 62 

the formula for solving 65-6 
the sum and product of the roots of 68, 
75, 433, 455 

quadrilateral 158, 173-4 

quotient (long division) 79 

Quotient Rule for differentiation 315-17 



radians 165-8, 366 

properties of small angles measured in 
169-71, 297 
radioactive decay 125, 411-13 
range 96, 1 14 

rate of change 291, 318, 336, 339, 353, 

373, 413, 416 
ratio theorems 474-5, 613 
rational numbers 30 
rationalising the denominator 37, 437 
real numbers 31-32, 423 
reciprocal functions see under functions 
recurrence relation 230, 244 
recurring decimals 242-3 
reduction formulas 402-6 
remainder from long division 79 
Remainder Theorem 80-1 
rhombus 141 

right-handed coordinate system 479 
roots 28-9 

of equations 64-5, 67-9, 78, 94, 162, 

422, 439. 450-8 
sign change showing presence of 351 
see also cubic equations, quadratic 
equations, trig functions and 
complex numbers 



scalar product 481-2 
scalar triple product 492-3 
scalars 470 

sectors and segments 155, 167-9 
selections (combinations) 265-6 
self-inverse functions 107 
separation of variables (differential 

equations) 409-11 
sequence 226 
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series 231 

comparison of 259-60 

convergence and divergence of 236-7, 

239-40, 250, 251, 258-60, 303, 

366, 369, 382 
giving functions 363-9, 440, 443 
partial sums of 237 
summing 231, 247-51, 258-60 
see also arithmetic series, geometric 

series, binomial expansions and 

terms 

shift (calculator) 121, 136, 185, 324 
SHM (simple harmonic motion) 183, 

198-200, 297-8, 419-20, 447-8 
X (sigma) notation 246-7 
similar shapes 140 
simultaneous equations 56-60, 161 
Sine Rule for triangles 146-8, 180 

ambiguous case 539-40 
sinh x 319 

see also hyperbolic functions 
sketching curves see graph sketching 
skew lines 516-17 
slope see gradient 
sphere 45, 88, 344 
square roots 28, 31-2, 36-7, 116 

of complex numbers 33, 438-9, 454 

simplifying 36 

y2 is irrational 3 1 -2 
stationary value 336, 341 
steepness see gradient 
straight lines 

equations of 50-53 

gradients of 49-50 
stress and strain 86 
subscript 227 

sum to infinity 235-6, 240, 250, 251, 

275-6 
superscript 365 
surds 37 



tan see trig functions 
tangent 

to a circle 155, 160-4, 353-6 



to a curve 291-2, 317, 330, 334-6, 348, 

357- 8 
tanhx 321 

see also hyperbolic functions 
Taylor's Theorem 364 
terminal velocity 418, 419 
terms 

comparing like 75, 76, 77-8, 253 
of sequences 226 

general terms of series 227, 230, 234, 
247, 268-9, 365-8 

ratio of successive 234, 238, 240, 258-9 
torque of a force 487-8, 490 
transforming functions 96-104, 114, 202-5 
translation 100 
triangle inequality 43 1 
triangles 

area of 137, 148-9 

centroid of 612 

congruent 140-1 

equilateral and isosceles 139-40 

similar 140, 142-3, 347, 417 

sum of interior angles 139 
trig functions 

arcsin, arccos and arctan 186, 216 

differentiating 295-7, 309, 332-3, 

358- 61 

in equations 215-25 
identities using 144-6, 190-2, 322-3, 
444-7 

integrating 389-94, 402-4, 405, 406-8 
inverses of 184-9, 216-7, 358-61, 
400 

principal values of 217, 218, 220, 222 
reciprocals of 133, 190 
relationship with e of 440-3 
sec, cosec and cot 133, 190 
series forms of 366, 369, 440 
sin, cos and tan (Z < 90°) 133, 143-4 
sin, cos and tan of any angle 179-82 
sin -1 , cos -1 and tan -1 see functions, 
inverses 

stretching and shifting 196-8, 225 
see also wave functions 
turning points 336, 343-8 



unit circle 441, 449, 451 

unit vectors 476, 480, 484-5, 490-1 



vector product 486, 489-90 
vector triple product 491-2 
vectors 471 

adding 471-3 

components of 478-9 

cross product of 486, 489-90 

direction 494 

dot product of 481-2 

length of 477, 479 

linearly independent 476, 481, 614 

magnitude of 477, 479 

normal 503-4 

orthogonal 482 

perpendicular 482, 484, 490 

position 471 

scalar product of 481-2 

scalar triple product of 492-3 

subtracting 473 

unit 476, 480, 484-5, 490-1 
velocity 292, 296, 298, 372, 373-4, 378, 
379, 418 

angular 183 

terminal 418, 419 
vertex 172 

voltage 197, 200, 208, 212, 448-50 



wave functions (waveforms) 176, 179, 185, 
197, 200, 202-5, 208, 212, 406, 
449-50 



Young's Modulus 87 



zero 30, 31, 61 

dividing by 18, 164, 193, 356, 359, 360 
zero vector 471, 489 
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